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cDNA encoding (-)-4S-limonene synthase from spearmint has been isolated and sequenced, and the corresponding amino acid sequence 
has been determined. Accordingly, isolated DNA sequences are provided which code for the expression of limonene synthase, such as 
the sequence designated SEQ ID No. 1 1 which encodes limonene synthase from spearmint (Mentha spicata). In other aspects, replicable 
recombinant cloning vehicles are provided which code for limonene synthase or for a base sequence sufficiently complementary to at least 
a portion of the limonene synthase DNA or RNA to enable hybridization therewith (e.g., antisense limonene synthase RNA or fragments 
of complementary limonene synthase DNA which are useful as polymerase chain reaction primers or as probes for limonene synthase 
or related genes). In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected 
with a recombinant cloning vehicle and/or DNA sequence encoding limonene synthase. Thus, systems and methods are provided for the 
recombinant expression of limonene synthase that may be used to facilitate the production, isolation and purification of significant quantities 
of recombinant limonene synthase (or of the primary enzyme product, limonene) for subsequent use, to obtain expression or enhanced 
expression of limonene synthase in plants to attain enhanced limonene production as a predator defense mechanism, or may be otherwise 
employed for the regulation or expression of limonene synthase or the production of limonene. 
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DNA ENCODING LIMONENE SYNTHASE FROM MENTHA SPICATA 

This invention was supported in part by grant number GM-3 1354 awarded by 
the National Institutes of Health. The government has certain rights in the invention. 

Field of the Invention 

The present invention relates to nucleic acid sequences which code for 
limonene synthase, a monoterpene cyclase enzyme, and to vectors containing the 
sequences, host cells containing the sequences and methods of producing recombinant 
limonene synthase. 

Background of the Invention 
Several hundred naturally occurring, monoterpenes are known, and essentially 
all are bio synthesized from geranyl pyrophosphate, the ubiquitous Cjq intermediate of 
the isoprenoid pathway (Croteau and Cane, Methods of Enzymology 110:383-405 
[1985]; Croteau, Chem. Rev. 87:929-954 [1987]). Monoterpene synthases, often 
referred to as "cyclases," catalyze the reactions by which geranyl pyrophosphate is 
cyclized to the various monoterpene carbon skeletons. These enzymes have received 
considerable recent attention because the cyclization process determines the basic 
character of the monoterpene end-products and because the cyclization mechanism is 
quite complex, involving multiple steps in which many of the carbon atoms of the 
substrate undergo alterations in bonding, hybridization, and configuration (Croteau 
and Cane, Methods of Enzymology 110:383-405 [1985], Croteau, Chem. Rev. 
87:929-954 [1987]). Research on monoterpene cyclases has also been stimulated by 
the possible regulatory importance of these enzymes that function at a branch point in 
isoprenoid metabolism (Gershenzon and Croteau, Biochemistry of the Mevalonic 
Acid Pathway to Terpenoids (Towers and Stafford, eds.), Plenum Press, New York, 
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NY, pp. 99-160 [1990]) as well as by the commercial significance of the essential oils 
(Guenther, The Essential Oils, Vols. HI-VI (reprinted) R E. Krieger, Huntington, NY 
[ 1 972]) and aromatic resins (Zinkel and Russell, Naval Stores: Production, 
Chemistry, Utilization, Pulp Chemicals Association, New York [1989]) and the 
5 ecological roles of these terpenoid secretions, especially in plant defense (Gershenzon 
and Croteau, in "Herbivores: Their Interactions with Secondary Plant Metabolites," 
Vol. I, 2nd Ed. (Rosenthal and Berenbaum, eds.) Academic Press, San Diego, CA, 
pp. 165-219 [1991]; Harborne, in "Ecological Chemistry and Biochemistry of Plant 
Terpenoids," (Harborne and Tomas-Barberan, eds.) Clarendon Press, Oxford, MA, 

10 pp. 399-426 [1991]). 

One of the major classes of plant monoterpenes is the monocyclic /?-menthane 
(l-methyl-4-isopropylcyclohexane) type, found in abundance in members of the mint 
{Mentha) family. The biosynthesis of p-menthane monoterpenes in Mentha species, 
including the characteristic components of the essential oil of peppermint (i.e., 

15 (-)-menthol) and the essential oil of spearmint (i.e., (-)-carvone), proceeds from 
geranyl pyrophosphate via the cyclic olefin (-)-limonene (Croteau, Planta Med. 57 
(suppl):10-14 [1991]), as shown in FIGURE 1. The transformation of geranyl 
pyrophosphate to limonene is seemingly the least complicated terpenoid cyclization 
(Croteau and Satterwhite, J. Biol. Chem. 264:15309-15315 [1989]) in having ample 

20 precedent in solvolytic model studies (Cramer and Rittersdorf, Tetrahedron 
23:3015-3022 [1967]; Haley et al., J. Chem. Soc. C, pp. 264-268 [1969], 
Kobayashi et al., Chem. Lett, pp. 1137-1138 [1976], Vial et al, Tetrahedron 
37:2351-2357 [1981]), and the responsible enzyme has become a prototype for the 
terpenoid cyclization reaction (Cori, Phytochemistry 22:331-341 [1983]; Pauly et al., 

25 Plant Cell Rep. 5:19-22 [1984]; Suga et al., Chem. Lett., pp. 115-118 [1988]; 
Perez et al., Plant Physiol Biochem. 28:221-229 [1990]; Rajaonarivony et al., Arch. 
Biochem. Biophys. 299:77-82 [1992]). The enzyme that produces the (-)-4S- 
enantiomer (geranyl pyrophosphate: (-)-4 i S'-limonene cyclase or, simply, 
(-)_4 J S , _li rno nene synthase) has been purified from peppermint (Mentha x piperita) and 

30 spearmint (Mentha spicata) oil glands (Alonso et al., J. Biol. Chem. 267:7582-7587 
[1992]), and highly specific antibodies directed against this enzyme have been 
prepared (Alonso et al.. Arch. Biochem. Biophys. 301:58-63 [1993]). In properties 
and mechanism of action (Rajaonarivony et al., Arch. Biochem. Biophys. 299:77-82 
[1992]; Rajaonarivony et al, Arch. Biochem. Biophys. 296:49-57 [1992]), it 

35 seemingly catalyzes a slow, possibly rate-limiting, step of monoterpene biosynthesis in 
Mentha (Gershenzon and Croteau, Biochemistry of the Mevalonic Acid Pathway to 
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Terpenoids (Towers and Stafford, eds.), Plenum Press, New York, NY, pp. 99-160 
[1990]). 

A detailed understanding of the control of monoterpene biosynthesis and of 
the cyclase reaction mechanism requires the relevant cDNA clones as tools for 
5 evaluating patterns of developmental and environmental regulation and for examining 
active site structure function relationships. There have been no previous reports of 
the cloning of a monoterpene cyclase although the molecular cloning of a 
sesquiterpene cyclase of plant origin (ep/'-aristolochene synthase from tobacco) has 
recently been reported (Facchini and Chappell, Proc. Natl. Acad. ScL USA 
10 89:11088-11092 [1992]), and the cloning of a diterpene cyclase (casbene synthase 
from castor bean) has also been accomplished. 

Summary of the Invention 
In accordance with the foregoing, cDNA encoding (-)-4 i S r -limonene synthase 
from spearmint has been isolated and sequenced, and the corresponding amino acid 
15 sequence has been determined. Accordingly, the present invention relates to isolated 
DNA sequences which code for the expression of limonene synthase, such as the 
sequence designated SEQ ID No:l 1 which encodes limonene synthase from spearmint 
(Mentha spicata). In other aspects, the present invention is directed to replicable 
recombinant cloning vehicles comprising a nucleic acid sequence, e.g., a DNA 
20 sequence, which codes for limonene synthase or for a base sequence sufficiently 
complementary to at least a portion of the limonene synthase DNA or RNA to enable 
hybridization therewith (e.g., antisense limonene synthase RNA or fragments of 
complementary limonene synthase DNA which are useful as polymerase chain 
reaction primers or as probes for limonene synthase or related genes). In yet other 
25 aspects of the invention, modified host cells are provided that have been transformed, 
transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA 
sequence of the invention. Thus, the present invention provides for the recombinant 
expression of limonene synthase, and the inventive concepts may be used to facilitate 
the production, isolation and purification of significant quantities of recombinant 
30 limonene synthase (or of the primary enzyme product, limonene) for subsequent use, 
to obtain expression or enhanced expression of limonene synthase in plants to attain 
enhanced limonene production as a predator defense mechanism, or may be otherwise 
employed in an environment where the regulation or expression of limonene synthase 
is desired for the production of limonene synthase or the enzyme product, limonene. 
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Brief Description of the Drawings 
The foregoing aspects and many of the attendant advantages of this invention 
will become more readily appreciated as the same becomes better understood by 
reference to the following detailed description, when taken in conjunction with the 
5 accompanying drawings, wherein: 

FIGURE 1 is a schematic representation of the principal pathways of 
monoterpene biosynthesis in spearmint leading to carvone and in peppermint leading 
to menthol. After geranyl pyrophosphate is cyclized to limonene, a series of 
secondary redox transformations convert this olefinic intermediate to other 

1 0 monoterpenes; 

FIGURE 2 shows the amino acid sequences of peptides derived by CNBr 
cleavage and V8 proteolysis of limonene synthase as described in Example 4. Also 
shown are oligonucleotide probes derived from the peptide sequences, as well as 
degenerate nucleotide substitutions in the oligomers. FIGURE 2A shows the peptide 

15 fragment designated CNBr #1 (99-119) (SEQ ID No:l) together with a 
corresponding 20-mer probe (SEQIDNo:4) and degenerate probe codings. 
FIGURE 2B shows the peptide fragment designated CNBr #2 (273-299) 
(SEQEDNo:2) together with a corresponding 23-mer probe (SEQIDNo:5) and 
degenerate probe codings. The solid line over CNBr #2 indicates the overlapping 

20 fragment V8 #1. FIGURE 2C shows the peptide designated CNBr #3 (342-361) 
(SEQ ID No;3) together with a 17-mer probe (SEQ ID No:6) and degenerate probe 
codings; 

FIGURE 3 is a graphical representation of thermal conductivity detector 
response in the radio-GLC separation of limonene from other monoterpene olefins 

25 generated by recombinant limonene synthase, as described in Example 7. Tracing B 
represents the detector response to authentic standards of a-pinene (1), p-pinene (2), 
myrcene (3) and limonene (4). Tracing A represents the detector response to the 
actual products resulting from expression of (4 i S)-limonene synthase. Tracing 3X 
represents a 3X amplified response for the minor components (1), (2) and (3) of 

30 tracing A; 

FIGURE 4 shows the nucleotide (SEQIDNo:8) and predicted amino acid 
sequence (SEQ ID No: 11) of spearmint limonene cyclase clone pLC 5.2, and a 
comparison of the 5-nucleotide sequence of spearmint limonene cyclase clone 
pLC 5.2 with partial sequences from pLC4.1 (SEQEDNo:7), pLC 8.5 
35 (SEQ ID No:9) and pLC 10.1 (SEQ ID No:10). As shown in FIGURE 4, the start 
and stop codons and the Shine-Delgarno sequences are underlined ( ). The 
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sequences of CNBr fragments #1, 2 and 3 (SEQIDNo:l, SEQ ID No:2 and 
SEQ ID No:3, respectively) are double underlined ( = ). Differences in the nucleotide 
sequences between pLC 4.1, 8.5 and 10.1, as compared to pLC 5.2, are indicated by 
lower case letters; 

FIGURE 5 is an immunoblot of crude protein extracts which were subjected 
to SDS-PAGE and then probed with anti-limonene synthase polyclonal antibodies, as 
described in Example 7. The immunoblot lanes are: native limonene synthase from 
spearmint (S); preparations from E. coli alone (XL-1 Blue) and E. coli transformed 
with Bluescript containing no insert (pBlue); and the clones pLC4.1, pLC 5.2, 
pLC 8.5 and pLC 10.1, respectively. The migration of protein standards (in kDa) is 
also indicated; and 

FIGURE 6 shows an amino acid comparison of spearmint limonene cyclase 
(bolded lines, SEQ ED No: 11) with tobacco ep/-aristolochene synthase (bottom) and 
castor bean casbene synthase (top), with amino acid residues 1 through 391 of 
spearmint limonene cyclase being shown in FIGURE 6 A and amino acid residues 392 
through 599 of spearmint limonene cyclase being shown in FIGURE 6B. The vertical 
bars ( | ) in FIGURE 6 mark identical residues. One dot ( . ) and two dots ( : ) 
indicate that two and one point mutations, respectively, are required to produce a 
match with spearmint limonene cyclase. Asterisks are marked above conserved 
histidine and cysteine residues in the three proteins. Putative metal ion-substrate 
complex binding domains are underlined ( _____ ). 

Detailed Description of the Preferred Embodiment 

As used herein, the terms "amino acid" and "amino acids" refer to all naturally 
occurring L-a-arnino acids or their residues. The amino acids are identified by either 
the single-letter or three-letter designations: 



Asp 


D 


aspartic acid 


He 


I 


isoleucine 


Thr 


T 


threonine 


Leu 


L 


leucine 


Ser 


S 


serine 


Tyr 


Y 


tyrosine 


Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 


Pro 


P 


proline 


His 


H 


histidine 


Gly 


G 


glycine 


Lys 


K 


lysine 


Ala 


A 


alanine 


Arg 


R 


arginine 


Cys 


C 


cysteine 


Trp 


W 


tryptophan 


Val 


V 


valine 


Gin 


Q 


glutamine 


Met 


M 


methionine 


Asn 


N 


asparagine 
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The terms "lirnonene synthase" and "lirnonene cyclase" are used synonymously 
herein to mean the enzyme, particularly from Mentha spicata, capable of catalyzing 
the cyclization of geranyl phosphate to (-)-4.S , -limonene, as described herein. 

The terms "alteration", "amino acid sequence alteration", "variant" and "amino 
5 acid sequence variant" refer to lirnonene synthase molecules with some differences in 
their amino acid sequences as compared to native lirnonene synthase. Ordinarily, the 
variants will possess at least 80% homology with native lirnonene synthase, and 
preferably, they will be at least about 90% homologous with native lirnonene synthase. 
The amino acid sequence variants of lirnonene synthase falling within this invention 

10 possess substitutions, deletions, and/or insertions at certain positions. Sequence 
variants of lirnonene synthase may be used to attain desired enhanced or reduced 
enzymatic activity, altered product distribution such as enhanced production of the 
normally minor co-products ct-pinene, p-pinene and/or myrcine, and the like. 

Substitutional lirnonene synthase variants are those that have at least one 

1 5 amino acid residue in the native lirnonene synthase sequence removed and a different 
amino acid inserted in its place at the same position. The substitutions may be single, 
where only one amino acid in the molecule has been substituted, or they may be 
multiple, where two or more amino acids have been substituted in the same molecule. 
Substantial changes in the activity of the lirnonene synthase molecule may be obtained 

20 by substituting an amino acid with a side chain that is significantly different in charge 
and/or structure from that of the native amino acid. This type of substitution would be 
expected to affect the structure of the polypeptide backbone and/or the charge or 
hydrophobicity of the molecule in the area of the substitution. 

Moderate changes in the activity of the lirnonene synthase molecule would be 

25 expected by substituting an amino acid with a side chain that is similar in charge 
and/or structure to that of the native molecule. This type of substitution, referred to 
as a conservative substitution, would not be expected to substantially alter either the 
structure of the polypeptide backbone or the charge or hydrophobicity of the molecule 
in the area of the substitution. 

30 Insertional lirnonene synthase variants are those with one or more amino acids 

inserted immediately adjacent to an amino acid at a particular position in the native 
lirnonene synthase molecule. Immediately adjacent to an amino acid means connected 
to either the a-carboxy or a-amino functional group of the amino acid. The insertion 
may be one or more amino acids. Ordinarily, the insertion will consist of one or two 

35 conservative amino acids. Amino acids similar in charge and/or structure to the amino 
acids adjacent to the site of insertion are defined as conservative. Alternatively, this 
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invention includes insertion of an amino acid with a charge and/or structure that is 
substantially different from the amino acids adjacent to the site of insertion. 

Deletional variants are those with one or more amino acids in the native 
limonene synthase molecule removed Ordinarily, deletional variants will have one or 
5 two amino acids deleted in a particular region of the limonene synthase molecule. 

The terms "biological activity", "biologically active", "activity" and "active" 
refer to the ability of the limonene synthase molecule to convert geranyl 
pyrophosphate to (-^^-limonene and co-products as measured in an enzyme activity 
assay, such as the assay described in Example 3 below. Amino acid sequence variants 

10 of limonene synthase may have desirable altered biological activity including, for 
example, altered reaction kinetics, product distribution or other characteristics. 

The terms "DNA sequence encoding", "DNA encoding" and "nucleic acid 
encoding" refer to the order or sequence of deoxyribonucleotides along a strand of 
deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order 

15 of amino acids along the polypeptide chain. The DNA sequence thus codes for the 
amino acid sequence. 

The terms "replicable expression vector" and "expression vector" refer to a 
piece of DNA, usually double-stranded, which may have inserted into it a piece of 
foreign DNA. Foreign DNA is defined as heterologous DNA, which is DNA not 

20 naturally found in the host cell. The vector is used to transport the foreign or 
heterologous DNA into a suitable host cell. Once in the host cell, the vector can 
replicate independently of the host chromosomal DNA, and several copies of the 
vector and its inserted (foreign) DNA may be generated. In addition, the vector 
contains the necessary elements that permit translating the foreign DNA into a 

25 polypeptide. Many molecules of the polypeptide encoded by the foreign DNA can 
thus be rapidly synthesized. 

The terms "transformed host cell" and "transformed" refer to the introduction 
of DNA into a cell. The cell is termed a "host cell", and it may be a prokaryotic or a 
eukaryotic cell. Typical prokaryotic host cells include various strains of E. coli. 

30 Typical eukaryotic host cells are plant cells, such as maize cells. The introduced DNA 
is usually in the form of a vector containing an inserted piece of DNA. The 
introduced DNA sequence may be from the same species as the host cell or a different 
species from the host cell, or it may be a hybrid DNA sequence, containing some 
foreign and some homologous DNA. 

35 In accordance with the present invention, cDNA encoding limonene synthase 

was isolated and sequenced in the following manner. -(-)-45'-Limonene synthase is 
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located exclusively in the glandular trichome secretory cells and catalyzes the first step 
of monoterpene biosynthesis in these essential oil species. Known methods for 
selectively isolating secretory cell clusters from these epidermal oil glands 
(Gershenzon et ah, Modern Phytochemical Methods (Fischer et al., Eds.), Plenum 
5 Press, New York, NY, pp. 347-370 [1991]; Gershenzon et al., Anal Biochem. 
200:130-138 [1992]) and for purifying this monomelic cyclase (M r -56,000) from 
these structures (Alonso and Croteau, Methods Plant Biochem. 9:239-260 [1993]; 
Lanznaster and Croteau, Protein Express. Purif. 2:69-74 [1991]) were employed to 
obtain sufficient amounts of protein from spearmint for polyclonal antibody 

10 production (Alonso et al., Arch. Biochem. Biophys., supra) and amino acid sequence 
analysis. Since the amino terminus of the cyclase was blocked, internal fragments 
were generated by CNBr cleavage and V8 proteolysis. Several of these fragments, 
comprising approximately 15% of the protein, were successfully sequenced and 
subsequently shown to span about half of the length of the protein starting from near 

15 the amino terminus (FIGURE 2). From this information three non-overlapping, 
degenerate oligonucleotide probes (20-mer, 23-mer and 17-mer) were identified 
having the degenerate codings: 

20-mer AAYATHGTNCARGCNCARTTYCA (SEQ ED No:5) 

1 7-mer AARGAYATHTAHGAYGT (SEQ ID No:3) 

20 As used herein, nucleotides are represented by the symbols recommended by 

the IUPAC-IUB Biochemical Nomenclature Commission, wherein A is adenine, C is 
cytosine, G is guanine, T is thymine, R is A or G, N is A or G or T or C or other, Y is 
C or T and H is A or C or T. Corresponding probes were prepared and used to 
screen a XZAP cDNA expression library, as described in Example 5, infra. The 
25 probes employed were as follows: 

20-mer GAAAAAGAAACAGATCAGAT (SEQ ED No:4 where 

3R-A, 6R=A, 9N=A, 12R=A, 15Y=T and 18R=G) 

23-mer AATATTGTICAAGCICAATTTCA (SEQ ID No: 5 where 

3Y=T, 6H=T, 9N=inosine, 12R=A, 15N-inosine, 18R=Aand 
30 21Y=T) 

1 7-mer AAGGATATTTACGATGT (SEQ ID No:6 where 

3R=G, 6Y=T, 9H=T, 12Y=C and 15Y=T) 

Four clones, designated pLC 4.1, pLC 5.2, pLC 8.5 and pLC 10.1, hybridized 

to all three oligonucleotides and persisted through four subsequent rounds of 
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screening and purification. The isolated phagemids were in vivo excised, circularized, 
and packaged; these phagemids (plasmids) were then used to transfect E. coli XL1- 
Blue (Skratagene) in accordance with the manufacturer's instructions as described, 
infra. The transfected cultures were induced with IPTG, and the cells from each were 
5 harvested, homogenized and assayed for limonene cyclase activity using 
[l- 3 H]geranyl pyrophosphate as substrate. Preparations from three of the four 
transfected cultures (with pLC 4.1, 5.2 and 8.5) afforded levels of limonene synthase 
activity in the range of 10-50 nmol/culture with specific activities in the range of 
0.5-1 0 nmol/h/mg protein. The plasmid pLC 5.2 has been deposited on October 26, 

10 1993 in the American Type Culture Collection, Rockville, Maryland, USA, under 

accession number ATCC . 

To examine in greater detail the product mixture generated by the recombinant 
limonene cyclases, the proteins encoded by pLC4.1, 5.2 and 8.5 were partially 
purified by anion-exchange chromatography on DEAE-Sepharose. A single olefin 

15 synthase per clone was observed on fractionation, and preparative-scale incubations 
with each were carried out in the presence of saturating concentrations of divalent 
metal ion cofactor (Mg 2+ ) and [l- 3 H]geranyl pyrophosphate. The pentane-soluble 
reaction products were isolated and passed through silica gel to provide the olefin 
fraction Following the addition of authentic monoterpene carriers, this material was 

20 analyzed by radio-GLC as described in Example 8 and shown to contain primarily 
limonene (—94%) with smaller amounts of other olefins (FIGURE 3). Integration of 
the amplified signal gave 1.8% a-pinene, 2.0% (3-pinene and 1.9% myrcene, which is 
identical to the levels of these minor coproducts generated by the limonene synthase 
from spearmint and peppermint (Rajaonarivony et al., Arch. Biochem. Biophys., 

25 supra). Thus, the recombinant proteins produced by pLC 4.5, 5.2 and 8.5 generated, 
with complete fidelity, the same product mixture as the native enzyme. The 
production of multiple products, while seemingly common among monoterpene 
synthases (Croteau, Chem. Rev. , supra, Alonso and Croteau, in Secondary Metabolite 
Biosynthesis and Metabolism (Petroski and McCormick, Eds.), Plenum Press, New 

30 York, NY, pp. 239-251 [1992]), is an unusual enzymatic phenomenon that can be 
verified by isotopically sensitive branching experiments (Alonso and Croteau, in 
Secondary Metabolite Biosynthesis and Metabolism, supra; Wagschal et al. 5 
Teti'ahedron 47:5933-5944 [1991]) but not, with certainty, by either co-purification 
or differential inactivation studies (Rajaonarivony et al., Arch. Biochem. Biophys, 

35 supra, Alonso et al., J. Biol. Chem., supra). The production of multiple products by 
a recombinant cyclase provides the ultimate proof of this unusual catalytic capability. 
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Since the pLC 5.2 cDNA gave the highest apparent specific activity of 
limonene cyclase expression, this isolate was completely sequenced and examined in 
detail (FIGURE 4) The pLC 5.2 limonene synthase cDNA (SEQ ID No: 8) is 2182 
nucleotides in length and contains a complete open reading frame of 1 800 nucleotides. 
5 The deduced amino acid sequence indicates the presence of a putative plastidial transit 
peptide of approximately 89 amino acids (SEQEDNo:ll, residues 1-89) and a 
mature protein of approximately 510 residues (SEQ ED No: 11, residues 90-599). The 
locations of all four peptide sequence fragments obtained from the native protein have 
been identified within the open reading frame of pLC 5.2 (FIGURE 4), confirming the 

1 0 cDNA to represent a limonene synthase gene. 

The transit peptide/mature protein junction and, thus, the exact lengths of both 
moieties are unknown because the amino terminus of the mature protein is blocked 
and has not yet been identified. The approximate size and charge of the leader 
sequence is appropriate for a plastidial transit peptide; the first 89 amino acids are 

1 5 characteristically high in serine and threonine content (-25%); and the Chou-Fasman 
rules (Chou and Fasman, Adv. Enzymol 47:45-147 [1978]) indicate a typical p-sheet 
adjacent to the putative cleavage site (Keegstra et al, Arm. Rev. Plant Physiol Plant 
Mol Biol 40:471-501 [1989], von Heijne et al., Eur, J. Biochem, 180:535-545 
[1989]; von Heijne and Nishikawa, FEBS Lett. 278:1-3 [1991]) which established 

20 criteria (von Heijne, Eur. J. Biochem. 133:17-21 [1983]) predict between or near 
residues Ala 89-Ser 90. The location of the CNBr fragment #1 along the deduced 
amino acid sequence defines the minimum size of the mature peptide since no 
fragments preceding Met 98 were observed. Translation of the putative mature 
limonene synthase cDNA (i.e., SEQIDNo:ll, residues 90-599) yields a protein of 

25 59.8 kD, which is within 10% of the 56 kD mass of the native (mature) enzyme 
estimated by gel permeation chromatography and SDS-PAGE (Alonso et al., J. Biol. 
Chem., supra). Additionally, the deduced amino acid composition of the translated 
pLC 5.2 cDNA open reading frame corresponding to the putative mature peptide 
(amino acid residues 90-599) agrees very well with the amino acid composition of the 

30 native enzyme, as shown in Table 1: 
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Table 1 

Amino acid composition of native Hmonene cyclase compared 

to that of the translated cDNA 





Deduced from 


Acid hydrolyzed 


Amino Acid 


translated cDNA 


native protein 


Ala 


25 


29 


Cys 


5 


ND 


Asp+Asn 


56 


63 


Glu+Gln 


68 


74 


r lie 






Gly 


24 


36 


His 


14 


12 


He 


26 




Lys 


27 


27 


Leu 


55 


58 


Met 


14 


ND 


Pro 


17 


15 


Arg 


30 


26 


Ser 


32 


28 


Thr 


23 


22 


Val 


32 


31 


Trp 


13 


ND 


Tyr 


21 


17 


Total residues 


510 




kD 


59.8 





The translated sequence includes only those amino acids of the putative 
mature peptide, residues #90-599. Asn/Asp and Gln/Glu could not be distinguished, 
and tryptophan, methionine and cysteine could not be determined (ND) from the acid 
hydrolyzed sample. The molecular weight of the native protein was determined by 
SDS-PAGE and gel permeation chromatography to be about 56,000. The x 2 test 
showed interdependence of the two compositions with p>0.97. 

Finally, ultrastructural immunogold cytochemical studies with anti-limonene 
cyclase polyclonal antibodies have recently demonstrated the limonene cyclase of mint 
to be located exclusively in the plastids (leucoplasts) of the glandular trichomes, a 
finding entirely consistent with the presence of such an organellar targeting sequence 
in the limonene synthase cDNA. Although the limonene synthase cDNA set forth in 
SEQ IDNo:8 directs the enzyme to plastids, substitution of the targeting sequence 
(SEQ ID No: 11, residues 1 to 89) with other transport sequences well known in the 
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art (see, e.g., Keegstra et al, supra, von Heijne et al., supra) may be employed to 
direct the limonene synthase to other cellular or extracellular locations. 

Spearmint (M spicata) is a tetraploid and parent of peppermint (M piperita = 
Mentha aquatica x spicata), a hexaploid (Harley and Brighton, Bot. J. Linn. Soc. 
5 74:71-96 [1977]). Both contain ostensibly the same limonene synthase (Alonso et al., 
J. Biol. Chem., supra). RNA blot hybridization of pLC 5.2 insert DNA to spearmint 
and peppermint poly(A) + RNA verified the presence of the homologous sequences in 
both species and provided an estimate of limonene cyclase mRNA transcript size of 
about 2,400 nucleotides (data not shown). Thus, all lines of evidence, including RNA 

10 blot hybridization, indicate that the limonene synthase structural gene from spearmint 
has been successfully isolated. All peptide sequence fragments obtained from the 
native enzyme correspond to sequences identified in the pLC 5.2 open reading frame, 
which is of the correct size and amino acid composition (excluding the putative transit 
peptide) The recombinant enzyme when expressed in E. coli is immunologically 

15 recognized by polyclonal antibodies raised to the purified limonene synthase from 
spearmint and is catalytically active in generating a product distribution identical to 
that of the native cyclase. 

The other full-length clones were partially sequenced and, although the first 
100 nucleotides downstream of the translation start were identical in the cDNA inserts 

20 of pLC 4.1, pLC 5.2 and pLC 8.5, suggesting that they share a common open reading 
frame, several nucleotide differences were observed in the 5 '-untranslated regions of 
all three (FIGURE 4). These differences suggest the presence of several limonene 
synthase genes and/or alleles in the allotetraploid spearmint genome, which has been 
confirmed by DNA blot (Southern blot) hybridization. A sesquiterpene cyclase, 

25 ey?/-aristolochene synthase, from tobacco is encoded by a gene family (Facchini and 
Chappell, Proc. Natl Acad. Sci. USA, supra). Partial sequencing of the clone 
pLC 10.1, that was inactive in limonene synthase expression, revealed a truncated 
transit peptide and several point mutations in the coding region (FIGURE 4). 
Additionally, the open reading frame of pLC 10.1 was out of frame with the lacZ 

30 translation initiation site. However, this complication alone is insufficient to explain 
the lack of expression since all of the other (full-length) cDNAs were also out of 
frame but nevertheless afforded functional expression of the limonene synthase. 

Based on the length of the full open reading frame, including the putative 
transit peptide, the transfected cultures should express a protein of at least 69 kD; 

35 however, immunoblot analysis of total cell extracts from all of the functional cultures 
as described in Example 8 exhibited no detectable antigen of this molecular weight but 



WO 95/11913 



-13- 



PCT/US94/12379 



rather a single major peptide at 54 kD (compared to native enzyme at 56 kD) 
(FIGURE 5). The most plausible explanation for the origin of this immunopositive 
54 kD peptide (and the larger peptides of lower abundance) is that translation 
reinitiation or run-through translation occurs. Proteolytic processing of the immature 
5 peptide by E. coli enzymes may also take place, much the same way as in spearmint 
plastids. Several leader peptidases of E. coli are similar to higher plant plastidal 
transit peptidase (Gottesman et al., Proc. Natl Acad Set USA 87:3513-3517 [1990], 
Halpin et al., EMBO J. 8:3917-3922 [1989]) and such processing of a recombinant 
plastid-directed enzyme has been described previously (Meadows and Robinson, 

10 Plant Mol Biol 17:1241-1243 [1991]). As observed in preliminary immunoblots, 
E. coli. bearing Bluescript II SK(+) alone or cDNA pLC 10.1 did not express 
recognizable cyclase antigen using the purified antibody preparation (FIGURE 5). 
This evidence indicates that although the cDNA is capable of being translated into 
proteins up to 69 kD, peptides as short as 59 kD are still biologically active as 

1 5 limonene synthase. 

Homology with other terpene cyclases - Searches of GenPept, PIR and 
SwissProt databases (Altschul et al., J. Mol Biol 215:403-410 [1990]) revealed no 
sequences with significant similarity to limonene synthase, nor did the sequence of 
limonene synthase resemble those of any of the microbial sesquiterpene cyclases that 

20 have been recently determined, including the Fusarium and Gibberella trichodiene 
synthases (Hohn and Beremand, Gene 79:131-138 [1989]; Hohn and Desjardins, Mol 
Plant-Microbe Interact 5:249-256 [1992]), the Penicillium artistolochene synthase 
(Proctor and Hohn, J. Biol Chem. 268:4543-4548 [1993]) and the streptomyces 
pentalenene synthase. However, sequence comparison (Devereaux et al., Nucleic 

25 Acids Pes. 12:387-395 [1984]) to other terpene cyclases from higher plants revealed 
a significant degree of sequence similarity. The limonene synthase from spearmint 
showed 34% identity and 56% similarity (based on conservative amino acid 
substitutions) to a sesquiterpene cyclase, ep/'-aristolochene synthase, from tobacco 
(Facchini and Chappell, Proc. Natl Acad. Sci. USA, supra), and 31% identity and 

30 53% similarity to a diterpene cyclase, casbene synthase, from castor bean (as shown in 
FIGURE 6). These plant terpenoid synthases represent three different types of 
cyclases from three diverse families, a monoterpene cyclase from the Lamiaceae, a 
sesquiterpene cyclase from the Solanaceae and a diterpene cyclase from the 
Euphorbiaceae suggesting a common ancestry for this class of enzymes. 

35 Recent evidence has implicated histidine and cysteine residues at the active site 

of limonene synthase and of several other monoterpene and sesquiterpene cyclases 
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(Rajaonarivony et al., Arch. Biochem. Biophys, supra). A search of the aligned 
sequences of limonene synthase, e/w-aristolochene synthase, and casbene synthase 
revealed the presence of four such conserved residues, positioned at amino acids 124, 
167, 251 and 516 of limonene cyclase (SEQ ED No:l 1; represented by * locations in 
5 FIGURE 6). 

The limonene synthase sequence does not resemble any of the published 
sequences for prenyltransferases (Clarke et al., Mol Cell Biol. 7:3138-3146 [1987]; 
Anderson et al., J. Biol Chem. 264:19176-19184 [1989]; Ashby and Edwards, J. 
Biol Chem. 265:13157-13164 [1990]; Carattoli et al., J. Biol Chem. 266:5854-5859 

10 [1991]; Fujisaki et al., J. Biochem. 108:995-1000 [1990]), a group of enzymes that, 
like the terpenoid cyclases, employ allylic pyrophosphate substrates and exploit similar 
electrophilic reaction mechanisms (Poulter and Rilling, supra). The sequence 
(I,L,V)XDDXXD occurs in two of the three homologous domains of several 
prenyltransferases of diverse origin (Math et al., Proc. Natl Acad. Sci. USA 

15 89:6761-6764 [1992]), and it has been suggested that these aspartate-rich elements 
function in binding the divalent metal ion complexed pyrophosphate moiety of the 
prenyl substrates (Ashby et al, in Molecular Biology of Atherosclerosis (Attie, Ed.), 
Elsevier, Amsterdam, pp. 27-34 [1990]). The terpenoid cyclases would be expected 
to exhibit similar substrate binding requirements, and most (Facchini and Chappell, 

20 supra; Hohn and Beremand, supra, Hohn and Desjardins, supra), but not all (Proctor 
and Hohn, supra), sesquiterpene cyclase sequences contain this motif. The deduced 
limonene cyclase peptide sequence contains the element VIDDIYD at SEQ ID No: 1 1 
residues 350-356 and the related sequence VDDTSYD at SEQ ID No: 11 
residues 397-403. The former motif is strongly conserved in the three plant-derived 

25 cyclase genes, but the latter is not, and neither is near the four conserved histidine and 
cysteine residues. Cane (Genetics and Biochemistry of Antibiotic Production (Vining 
and Stuttard, Eds.), Butterworth-Heinemann, Stoneham, MA, in press [1993]) has 
urged caution in interpreting the functional role of these aspartate-rich motifs and has 
pointed out an additional motif in prenyltransferases and cyclases that is rich in basic 

30 amino acids and that has been implicated at the active site of farnesyl pyrophosphate 
synthase (Brems et al., Biochemistry 20:3711-3718 [1981]). Similar patches of 
charged amino acids have been described in other terpenoid synthases 
(Chamovitz et al., FEBS Lett. 296:305-311 [1992]), but neither region has very close 
analogy in the limonene synthase sequence. 

35 The isolation of the limonene cyclase cDNA permits the development of an 

efficient expression system for this functional enzyme with which such detailed 
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mechanistic studies can be undertaken. The limonene cyclase cDNA also provides a 
useful tool for isolating other monoterpene cyclase genes and for examining the 
developmental regulation of monoterpene biosynthesis. 

In addition to the native limonene synthase amino acid sequence of 
5 SEQ ED No: 1 1 encoded by the DNA sequence of pLC 5.2 (SEQ ED No: 8), sequence 
variants produced by deletions, substitutions, mutations and/or insertions are intended 
to be within the scope of the invention except insofar as limited by the prior art. 

The limonene synthase amino acid sequence variants of this invention are 
preferably constructed by mutating the DNA sequence that encodes wild-type 

10 limonene synthase. Generally, particular regions or sites of the DNA will be targeted 
for mutagenesis, and thus the general methodology employed to accomplish this is 
termed site-directed mutagenesis. The mutations are made using DNA modifying 
enzymes such as restriction endonucleases (which cleave DNA at particular 
locations), nucleases (which degrade DNA) and/or polymerases (which synthesize 

15 DNA). 

The potential of site-directed mutagenesis to reveal active site structure- 
function relationships is considerable because of the power of the technique, the 
unique catalytic properties of the monoterpene cyclases, and the large number of 
mechanistic probes available from the sequences disclosed herein to examine the 

20 reaction carried out by this enzyme type. Active site cysteine and histidme residues, 
the conserved DDXXD motif (see SEQ ID No: 11 residues 350-356), and putative 
deprotonation bases located by the mechanism-based inhibitor are potential targets for 
this approach, although other sites may be targeted using these techniques. Similarly, 
ongoing research with the sesquiterpene and diterpene cyclases and prenyltransferases 

25 may indicate additional conserved elements that could be targeted for mutagenesis. 
Photoaffinity probe techniques may also be used to locate the presumptive 
hydrophobic domain(s) of the monoterpene cyclases that likely will span a significant 
number of residues. 

A His*Tag pET system (Novagen) as described below may be used for 
30 overexpression and protein purification, and can very conveniently carry out the 
mutagenesis studies in this expression plasmid using a two primer system such as the 
Transformer Site-Directed Mutagenesis kit from Clontech. Following denaturation of 
the target plasmid, two primers are simultaneously annealed to the plasmid; one of 
these primers contains the desired site-directed mutation, the other contains a 
35 mutation at another point in the plasmid resulting in elimination of a restriction site. 
Second strand synthesis is then carried out, tightly linking these two mutations, and 
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the resulting plasmids are transformed into a mutS strain of E. coli. Plasmid DNA is 
isolated from the transformed bacteria, restricted with the relevant restriction enzyme 
(thereby linearizing the unmutated plasmids), and then retransformed into E. coli. 
This system allows for generation of mutations directly in the pET expression 
5 plasmid, without the necessity of subcloning or generation of single-stranded 
phagemids The tight linkage of the two mutations and the subsequent linearization of 
unmutated plasmids results in high mutation efficiency and allows minimal screening. 
Following synthesis of the initial restriction site primer, this method requires the use 
of only one new primer type per mutation site. Rather than prepare each positional 

10 mutant separately, a set of "designed degenerate" oligonucleotide primers can be 
synthesized in order to introduce all of the desired mutations at a given site 
simultaneously. Transformants can be screened by sequencing the plasmid DNA 
through the mutagenized region to identify and sort mutant clones. Each mutant 
DNA can then be restricted and analyzed by electrophoresis on Mutation Detection 

15 Enhancement gel (J.T. Baker) to confirm that no other alterations in the sequence 
have occurred (by band shift comparison to the unmutagenized control). 

In the case of the hydrophobic cleft of the cyclases, a number of residues may 
be mutagenized in this region. Directed mutagenesis can also be used to create 
cassettes for saturation mutagenesis. Once a hydrophobic segment of the active site is 

20 identified, oligonucleotide-directed mutagenesis can be used to create unique 
restriction sites flanking that region to allow for the removal of the cassette and the 
subsequent replacement with synthetic cassettes containing any number of mutations 
within. This approach can be carried out with any plasmid : without need for 
subcloning or generation of single-stranded phagemids. 

25 The verified mutant duplexes in the pET overexpression vector can be 

employed to transform R coli such as strain E. coli BL21(DE3)pLysS, for high level 
production of the mutant protein, and purification by metal ion affinity 
chromatography and thrombin proteolysis. The method of FAB-MS mapping can be 
employed to rapidly check the fidelity of mutant expression. This technique provides 

30 for sequencing segments throughout the whole protein and provides the necessary 
confidence in the sequence assignment. In a mapping experiment of this type, protein 
is digested with a protease (the choice will depend on the specific region to be 
modified since this segment is of prime interest and the remaining map should be 
identical to the map of unmutagenized protein). The set of cleavage fragments is 

35 fractionated by microbore HPLC (reversed phase or ion exchange, again depending 
on the specific region to be modified) to provide several peptides in each fraction, and 
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the molecular weights of the peptides are determined by FAB-MS. The masses are 
then compared to the molecular weights of peptides expected from the digestion of 
the predicted sequence, and the correctness of the sequence quickly ascertained. 
Since this mutagenesis approach to protein modification is directed, sequencing of the 
5 altered peptide should not be necessary if the MS agrees with prediction. If necessary 
to verify a changed residue, CAD-tandem MS/MS can be employed to sequence the 
peptides of the mixture in question, or the target peptide purified for subtractive 
Edman degradation or carboxypeptidase Y digestion depending on the location of the 
modification. 

10 In the design of a particular site directed mutagenesis, it is generally desirable 

to first make a non-conservative substitution (e.g., Ala for Cys, His or Glu) and 
determine if activity is greatly impaired as a consequence. The properties of the 
mutagenized protein are then examined with particular attention to the kinetic 
parameters of K m and k cat as sensitive indicators of altered function, from which 

15 changes in binding and/or catalysis per se may be deduced by comparison to the 
native cyclase. If the residue is by this means demonstrated to be important by 
activity impairment, or knockout, then conservative substitutions can be made, such 
as Asp for Glu to alter side chain length, Ser for Cys, or Arg for His. For 
hydrophobic segments, it is largely size that we will alter, although aromatics can also 

20 be substituted for alkyl side chains. The monoterpene cyclases have the intrinsic 
ability to construct multiple products. Thus, changes in the normal product 
distribution can indicate which step(s) of the reaction sequence have been altered by 
the mutation. For example, an increase in alcohols relative to olefins could indicate 
modified ionization and/or termination (solvent capture vs. deprotonation). A change 

25 in the ratio of acyclics to cyclics could implicate an alteration of the cyclization step 
relative to isomerization (rate data must also be considered here). Modification of the 
hydrophobic pocket can be employed to change binding conformations for substrate 
and intermediates and result in altered isomer composition reflecting regiochemical 
and/or stereochemical change. With an indication of the reaction step that has been 

30 compromised, other substrate and intermediate analogues and inhibitors can be 
employed to examine ionization, isomerization, cyclization, ion-pairing, 
rearrangements, termination steps and other mechanistic features of the native 
cyclases. Analogues can also be used to evaluate (by inhibition studies) the principal 
binding determinants, and any changes in binding parameters could be interpreted by 

35 this means. Equilibrium dialysis and mechanism-based inhibitors can also be used to 
evaluate binding phenomena in catalytically impaired mutants. For some mutants, 
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alteration in the interaction with divalent metal ion cofactors can be revealing, as can 
change in pH optima that may reflect pk of active site acids or bases. These 
techniques can be supplemented with X-ray structural investigations, since 
mutagenesis alone may not distinguish between residues that are essential for catalysis 
5 and residues that are important in maintaining the overall structure of the enzyme. 

Other site directed mutagenesis techniques may also be employed with the 
nucleotide sequences of the invention. For example, restriction endonuclease 
digestion of DNA followed by ligation may be used to generate deletions, as 
described in section 15.3 of Sambrook et al. {Molecular Cloning: A Laboratory 

10 Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, New York, NY [1989]). To 
use this method, it is preferable that the foreign DNA be inserted into a plasmid 
vector. A restriction map of both the foreign (inserted) DNA and the vector DNA 
must be available, or the sequence of the foreign DNA and the vector DNA must be 
known. The foreign DNA must have unique restriction sites that are not present in 

1 5 the vector. Deletions are then made in the foreign DNA by digesting it between these 
unique restriction sites, using the appropriate restriction endonucleases under 
conditions suggested by the manufacturer of the enzymes. If the restriction enzymes 
used create blunt ends or compatible ends, the ends can be directly ligated together 
using a ligase such as bacteriophage T4 DNA ligase and incubating the mixture in the 

20 presence of ATP and ligase buffer as described in section 1.68 of Sambrook et al., 
supra. If the ends are not compatible, they must first be made blunt by using the 
Klenow fragment of DNA polymerase I or bacteriophage T4 DNA polymerase, both 
of which require the four deoxyribonucleotide triphosphates to fill-in the overhanging 
single-stranded ends of the digested DNA. Alternatively, the ends may be blunted 

25 using a nuclease such as nuclease SI or mung-bean nuclease, both of which function 
by cutting back the overhanging single strands of DNA. The DNA is then religated 
using a ligase. The resulting molecule is a limonene synthase deletion variant. 

A similar strategy may be used to construct insertion variants, as described in 
section 15.3 of Sambrook et ah, supra. After digestion of the foreign DNA at the 

30 unique restriction site(s), an oligonucleotide is ligated into the site where the foreign 
DNA has been cut. The oligonucleotide is designed to code for the desired amino 
acids to be inserted and additionally has 5' and 3' ends that are compatible with the 
ends of the foreign DNA that have been digested, such that direct ligation is possible. 

Oligonucleotide-directed mutagenesis may be employed for preparing 

35 substitution variants of this invention. It may also be used to conveniently prepare the 
deletion and insertion variants of this invention. This technique is well known in the 
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art as described by Adelman et al. {DNA, 2:183 [1983]). Generally, oligonucleotides 
of at least 25 nucleotides in length are used to insert, delete or substitute two or more 
nucleotides in the limonene synthase molecule. An optimal oligonucleotide will have 
12 to 15 perfectly matched nucleotides on either side of the nucleotides coding for the 
5 mutation. This ensures that the oligonucleotide will hybridize properly to the single- 
stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques well known in the art such as that described by Crea et al. (Proc. Natl 
Acad. Sci. USA, 75:5765 [1978]), specifically incorporated by reference. 

The DNA template molecule is the single-stranded form of the vector with its 

10 wild-type cDNA limonene synthase insert. The single- stranded template can only be 
generated by those vectors that are either derived from bacteriophage Ml 3 vectors 
(the commercially available M13mpl8 and M13mpl9 vectors are suitable), or those 
vectors that contain a single-stranded phage origin of replication as described by 
Veira et al {Meth. Enzymol^ 153:3 [1987]). Thus, the cDNA limonene synthase that 

15 is to be mutated may be inserted into one of these vectors in order to generate single- 
stranded template. Production of the single-stranded template is described in 
sections 4.21-4.41 of Sambrook et al., supra. 

To mutagenize the wild-type limonene synthase, the oligonucleotide is 
annealed to the single-stranded DNA template molecule under suitable hybridization 

20 conditions. A DNA polymerizing enzyme, usually the Klenow fragment of E. coli 
DNA polymerase I, is then added. This enzyme uses the oligonucleotide as a primer 
to complete the synthesis of the mutation-bearing strand of DNA. Thus, a 
heteroduplex molecule is formed such that one strand of DNA encodes the wild-type 
limonene synthase inserted in the vector, and the second strand of DNA encodes the 

25 mutated form of limonene synthase inserted into the same vector. This heteroduplex 
molecule is then transformed into a suitable host cell, usually a prokaryote such as E. 
coli JM101. After growing the cells, they are plated on to agarose plates and 
screened using the oligonucleotide primer radiolabeled with 32-P to identify the 
colonies that contain the mutated limonene synthase. These colonies are selected, and 

30 the DNA is sequenced to confirm the presence of mutations in the limonene synthase 
molecule. 

Mutants with more than one amino acid substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide 
chain, they may be mutated simultaneously using one oligonucleotide that codes for 
35 all of the desired amino acid substitutions If however, the amino acids are located 
some distance from each other (separated by more than ten amino acids, for example) 
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it is more difficult to generate a single oligonucleotide that encodes all of the desired 
changes. Instead, one of two alternative methods may be employed. In the first 
method, a separate oligonucleotide is generated for each amino acid to be substituted. 
The oligonucleotides are then annealed to the single-stranded template DNA 
5 simultaneously, and the second strand of DNA that is synthesized from the template 
will encode all of the desired amino acid substitutions. The alternative method 
involves two or more rounds of mutagenesis to produce the desired mutant. The first 
round is as described for the single mutants: wild-type limonene synthase DNA is 
used for the template, an oligonucleotide encoding the first desired amino acid 

10 substitution(s) is annealed to this template, and the heteroduplex DNA molecule is 
then generated. The second round of mutagenesis utilizes the mutated DNA 
produced in the first round of mutagenesis as the template Thus, this template 
already contains one or more mutations. The oligonucleotide encoding the additional 
desired amino acid substitution(s) is then annealed to this template, and the resulting 

1 5 strand of DNA now encodes mutations from both the first and second rounds of 
mutagenesis. This resultant DNA can be used as a template in a third round of 
mutagenesis, and so on. 

Prokaryotes are the preferred host cells for the initial cloning steps of this 
invention. They are particularly useful for rapid production of large amounts of DNA, 

20 for production of single-stranded DNA templates used for site-directed mutagenesis, 
for screening many mutants simultaneously, and for DNA sequencing of the mutants 
generated. Suitable prokaryotic host cells include E. coli K12 strain 294 (ATCC 
number 31,446), & coli strain W3 1 10 (ATCC number 27,325) K coli XI 776 (ATCC 
number 31,537), and E. coli B; however many other strains of E. coli, such as 

25 HB101, JM101, NM522, NM538, NM539, and many other species and genera of 
prokaryotes including bacilli such as Bacillus subtilis, other enterobacteriaceae such 
as Salmonella typhimurium or Serratia marcesans, and various Pseudomonas species 

may all be used as hosts. 

As a representative example, cDNA sequences encoding limonene synthase 

30 may be transferred to the (His) 6 »Tag pET vector commercially available (from 
Novagen) for overexpression in E. coli as heterologous host. This pET expression 
plasmid has several advantages in high level heterologous expression systems. The 
desired cDNA insert is ligated in frame to plasmid vector sequences encoding six 
histidines followed by a highly specific protease recognition site (thrombin) that are 

35 joined to the amino terminus codon of the target protein. The histidine "block" of the 
expressed fusion protein promotes very tight binding to immobilized metal ions and 
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permits rapid purification of the recombinant protein by immobilized metal ion affinity 
chromatography. The histidine leader sequence is then cleaved at the specific 
proteolysis site by treatment of the purified protein within thrombin, and the limonene 
synthase again purified by immobilized metal ion affinity chromatography, this time 
5 using a shallower imidazole gradient to elute the recombinant synthase while leaving 
the histidine block still adsorbed. This overexpression-purification system has high 
capacity, excellent resolving power and is fast, and the chance of a contaminating 
E. coli protein exhibiting similar binding behavior (before and after thrombin 
proteolysis) is extremely small. 

1 0 The pET vector also contains a consensus ribosome binding site that directs 

efficient translation of the desired protein, and a strong T7 polymerase promoter to 
dnve transcription of the cyclase gene. The T7 RNA polymerase gene is located on 
the bacterial host chromosome and its transcription is driven by the IaclJVS promoter 
(the E. coli host strain BL21(DE3)pLysS is also commercially available from 

1 5 Novagen). Upon EPTG induction, this overexpression system is capable of producing 
10-90% of total E. coli protein as recombinant protein. In addition, the pLysS cell 
line maintains target genes transcriptionally silent prior to EPTG induction, allowing 
for expression of potentially toxic genes. 

As will be apparent to those skilled in the art, other plasmid vectors containing 

20 replicon and control sequences that are derived from species compatible with the host 
cell may also be used in the practice of the invention. The vector usually has a 
replication site, marker genes that provide phenotypic selection in transformed cells, 
one or more promoters, and a polylinker region containing several restriction sites for 
insertion of foreign DNA. Plasmids typically used for transformation of E. coli 

25 include pBR322, pUC18, pUC19, pUCI18, pUC119, and Bluescript M13, all of 
which are described in sections 1.12-1.20 of Sambrook et al., supra However, many 
other suitable vectors are available as well. These vectors contain genes coding for 
ampicillin and/or tetracycline resistance which enables cells transformed with these 
vectors to grow in the presence of these antibiotics. 

30 The promoters most commonly used in prokaryotic vectors include the 

p-lactamase (penicillinase) and lactose promoter systems (Chang et al. Nature, 
375:615 [1978]; Itakura et a!., Science, 198:1056 [1977]; GoeddeletaL, Nature, 
281:544 [1979]) and a tryptophan (trp) promoter system (Goeddel et al, NucL Acids 
Res., 8:4057 [1980]; EPO Appl. Publ. No. 36,776), and the alkaline phosphatase 

35 systems. While these are the most commonly used, other microbial promoters have 
been utilized, and details concerning their nucleotide sequences have been published, 
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enabling a skilled worker to ligate them functionally into plasmid vectors (see 
Siebenlist et ah, Cell, 20:269 [1980]). 

Eukaryotic microbes such as yeasts may be used to practice this invention. 
The baker's yeast Saccharomyces cerevisiae, is a commonly used eukaryotic 
5 microorganism, although several other strains are available. The plasmid YRp7 
(Stinchcomb et al., Nature, 282:39 [1979]; Kingsman et al., Gene, 7:141 [1979]; 
Tschemper et al., Gene, 10:157 [1980]) is commonly used as an expression vector in 
Saccharomyces. This plasmid contains the trpl gene that provides a selection marker 
for a mutant strain of yeast lacking the ability to grow in tryptophan, such as strains 

10 ATCC No. 44,076 and PEP4-1 (Jones, Genetics, 85:12 [1977]). The presence of the 
trpl lesion as a characteristic of the yeast host cell genome then provides an effective 
environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 
3-phosphoglycerate kinase (Hitzeman et al., J. Biol Chem., 255:2073 [1980]) or 

15 other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg., 7:149 [1968]; 
Holland et al, Biochemistry, 17:4900 [1978]), such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. In the 

20 construction of suitable expression plasmids, the termination sequences associated 
with these genes are also ligated into the expression vector 3' of the sequence desired 
to be expressed to provide polyadenylation of the mRNA and termination. Other 
promoters that have the additional advantage of transcription controlled by growth 
conditions are the promoter region for alcohol dehydrogenase 2, isocytochrome C, 

25 acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
aforementioned glyceraldehyde-3 -phosphate dehydrogenase, and enzymes responsible 
for maltose and galactose utilization. Any plasmid vector containing yeast-compatible 
promoter, origin of replication and termination sequences is suitable. 

Cell cultures derived from multicellular organisms and multicellular organisms, 

30 such as plants, may be used as hosts to practice this invention. For example, 
transgenic plants can be obtained such as by transferring plasmids constructed in 
E. coli that encode limonene synthase and preferably a selectable marker gene, e.g., 
the kan gene encoding resistance to kanamycin, into Agrobacterium tumifaciens 
containing a helper Ti plasmid as described in Hoeckema et al, Nature 303:179-181 

35 [1983] and culturing the Agrobacterium cells with leaf slices of the plant to be 
transformed as described by An et al., Plant Physiology 81:301-305 [1986]. 
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Transformed plant calli may be selected through the selectable marker by growing the 
cells on a medium containing, e.g., kanamycin, and appropriate amounts of 
phytohormone such as naphthalene acetic acid and benzyladenine for callus and shoot 
induction. The plant cells may then be regenerated and the resulting plants transferred 
5 to soil using techniques well known to those skilled in the art. 

In addition, a gene regulating limonene synthase production can be 
incorporated into the plant along with a necessary promoter which is inducible. In the 
practice of this embodiment of the invention, the native promoter region for limonene 
synthase is removed from a cloned gene encoding it and is replaced with a promoter 
10 that only responds to a specific external or internal stimulus. Thus, the gene will not 
be transcribed except in response to the specific stimulus. As long as the gene is not 
being transcribed, its gene product is not produced (nor is the corresponding product, 
limonene). 

An illustrative example of a responsive promoter system that can be used in 

15 the practice of this invention is the glutathione-S-transferase (GST) system in maize. 
GSTs are a family of enzymes that can detoxify a number of hydrophobic electrophilic 
compounds that often are used as pre-emergent herbicides (Weigand et al., Plant 
Molecular Biology 7:235-243 [1986]). Studies have shown that the GSTs are 
directly involved in causing this enhanced herbicide tolerance. This action is primarily 

20 mediated through a specific 1.1 kb mRNA transcription product. In short, maize has 
a naturally occurring quiescent gene already present that can respond to GSTs and 
that can be induced to produce a gene product. This gene has previously been 
identified and cloned. Thus, in one embodiment of this invention, the promoter is 
removed from the GST responsive gene and attached to a limonene synthase gene that 

25 previously has had its native promoter removed. This engineered gene is the 
combination of a promoter that responds to an external chemical stimulus and a gene 
responsible for successful production of limonene synthase. 

In addition to the methods described above, several methods are known in the 
art for transferring cloned DNA into a wide variety of plant species, including 

30 gymnosperms, angiosperms, monocots and dicots (see, e.g., Glick and Thompson, 
eds., Methods in Plant Molecular Biology, CRC Press, Boca Raton, Florida, 1993). 
Representative examples include electroporation-facilitated DNA uptake by 

protoplasts (Rhodes et al., Science 240: [1988]); treatment of protoplasts with 

polyethylene glycol (Lyznik et al., Plant Molecular Biology 13:151-161 [1989]); and 

35 bombardment of cells with DNA laden microprojectiles (Klein et al., Plant Physiol. 
91:440-444 [1989]) and Klein et al., Bio/Technology 6: [1988]), all incorporated 
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by reference. Minor variations make these technologies applicable to a broad range of 
plant species. 

Each of these techniques has advantages and disadvantages. In each of the 
techniques, DNA from a plasmid is genetically engineered such that it contains not 
5 only the gene of interest, but also selectable and screenable marker genes. A 
selectable marker gene is used to select only those cells that have integrated copies of 
the plasmid (the construction is such that the gene of interest and the selectable and 
screenable genes are transferred as a unit). The screenable gene provides another 
check for the successful culturing of only those cells carrying the genes of interest. A 
1 0 commonly used selectable marker gene is neomycin phosphotransferase II (NPT II). 
This gene conveys resistance to kanamycin, a compound that can be added directly to 
the growth media on which the cells grow. Plant cells are normally susceptible to 
kanamycin and, as a result, die. The presence of the NPT II gene overcomes the 
effects of the kanamycin and each cell with this gene remains viable. Another 
1 5 selectable marker gene which can be employed in the practice of this invention is the 
gene which confers resistance to the herbicide glufosinate (Basta). A screenable gene 
commonly used is the p-glucuronidase gene (GUS). The presence of this gene is 
characterized using a histochemical reaction in which a sample of putatively 
transformed cells is treated with a GUS assay solution After an appropriate 
20 incubation, the cells containing the GUS gene turn blue. Another screenable gene is a 
transcriptional activator for anthocyanin biosynthesis, as described in the copending 
application of Bowen et al., U.S. patent application serial No. 387,739, filed 
August 1, 1989. This gene causes the synthesis of the pigment anthocyanin. Cells 
transformed with a plasmid containing this gene turn red. Preferably, the plasmid will 
25 contain both selectable and screenable marker genes. 

The plasmid containing one or more of these genes is introduced into either 
maize protoplasts or callus cells by any of the previously mentioned techniques. If the 
marker gene is a selectable gene, only those cells that have incorporated the DNA 
package survive under selection with the appropriate phytotoxic agent. Once the 
30 appropriate cells are identified and propagated, plants are regenerated. Progeny from 
the transformed plants must be tested to insure that the DNA package has been 
successfully integrated into the plant genome. 

Mammalian host cells may also be used in the practice of the invention. 
Examples of suitable mammalian cell lines include monkey kidney CVI line 
35 transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line 293 S 
(Graham et al., J. Gen. Virol. 36:59 [1977]); baby hamster kidney cells (BHK, 
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ATCC CCL 10); Chinese hamster ovary cells (Urlab and Chasin, Proc. Natl. Acad. 
Sci USA, 77:4216 [1980]); mouse Sertoli cells (TM4, Mather, Biol Reprod, 23:243 
[1980]); monkey kidney cells (CVI-76, ATCC CCL 70); African green monkey 
kidney cells (VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, 
5 ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells 
(BRL 3A, ATCC CRL 1442 ; human lung cells (W138, ATCC CCL 75); human liver 
cells (Hep G2, HB 8065); mouse mammary tumor cells (MMT 060562, 
ATCC CCL 51); rat hepatoma cells (HTC, MI. 54, Baumann et al., J. Cell Biol, 85:1 
[1980]); and TRJ cells (Mather et al., Annals NY. Acad. Sci., 383:44 [1982]). 
10 Expression vectors for these cells ordinarily include (if necessary) DNA sequences for 
an origin of replication, a promoter located in front of the gene to be expressed, a 
ribosome binding site, an RNA splice site, a polyadenylation site, and a transcription 
terminator site. 

Promoters used in mammalian expression vectors are often of viral origin. 

15 These viral promoters are commonly derived from polyoma virus, Adenovirus2, and 
most frequently Simian Virus 40 (SV40). The SV40 virus contains two promoters 
that are termed the early and late promoters. These promoters are particularly useful 
because they are both easily obtained from the virus as one DNA fragment that also 
contains the viral origin of replication (Fiers et al., Nature, 273:113 [1978]). Smaller 

20 or larger SV40 DNA fragments may also used, provided they contain the 
approximately 250-bp sequence extending from the Hindlll site toward the Bgll site 
located in the viral origin of replication. 

Alternatively, promoters that are naturally associated with the foreign gene 
(homologous promoters) may be used provided that they are compatible with the host 

25 cell line selected for transformation. 

An origin of replication may be obtained from an exogenous source, such as 
SV40 or other virus (e.g., Polyoma, Adeno, VSV, BPV) and inserted into the cloning 
vector. Alternatively, the origin of replication may be provided by the host cell 
chromosomal replication mechanism. If the vector containing the foreign gene is 

30 integrated into the host cell chromosome, the latter is often sufficient. 

Satisfactory amounts of limonene synthase are produced by transformed cell 
cultures. However, the use of a secondary DNA coding sequence can enhance 
production levels. The secondary coding sequence typically comprises the enzyme 
dihydrofolate reductase (DHFR). The wild-type form of DHFR is normally inhibited 

35 by the chemical methotrexate (MTX). The level of DHFR expression in a cell will 
vary depending on the amount of MTX added to the cultured host cells. An 
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additional feature of DHFR that makes it particularly useful as a secondary sequence 
is that it can be used as a selection marker to identify transformed cells. Two forms of 
DHFR are available for use as secondary sequences, wild-type DHFR and MTX- 
resistant DHFR. The type of DHFR used in a particular host cell depends on whether 
5 the host cell is DHFR deficient (such that it either produces very low levels of DHFR 
endogenously, or it does not produce functional DHFR at all). DHFR-deficient cell 
lines such as the CHO cell line described by Urlaub and Chasm, supra, are 
transformed with wild-type DHFR coding sequences. After transformation, these 
DHFR-deficient cell lines express functional DHFR and are capable of growing in a 

10 culture medium lacking the nutrients hypoxanthine, glycine and thymidine. 
Nontransformed cells will not survive in this medium. 

The MTX-resistant form of DHFR can be used as a means of selecting for 
transformed host cells in those host cells that endogenously produce normal amounts 
of functional DHFR that is MTX sensitive. The CHO-K1 cell line (ATCC number 

15 CL 61) possesses these characteristics, and is thus a useful cell line for this purpose. 
The addition of MTX to the cell culture medium will permit only those cells 
transformed with the DNA encoding the MTX-resistant DHFR to grow. The 
nontransformed cells will be unable to survive in this medium. 

Many eukaryotic proteins normally secreted from the cell contain an 

20 endogenous signal sequence as part of the amino acid sequence. This sequence 
targets the protein for export from the cell via the endoplasmic reticulum and Golgi 
apparatus. The signal sequence is typically located at the amino terminus of the 
protein, and ranges in length from about 13 to about 36 amino acids. Although the 
actual sequence varies among proteins, all known eukaryotic signal sequences contain 

25 at least one positively charged residue and a highly hydrophobic stretch of 10-15 
amino acids (usually rich in the amino acids leucine, isoleucine, alanine, valine and 
phenylalanine) near the center of the signal sequence. The signal sequence is normally 
absent from the secreted form of the protein, as it is cleaved by a signal peptidase 
located on the endoplasmic reticulum during translocation of the protein into the 

30 endoplasmic reticulum. The protein with its signal sequence still attached is often 
referred to as the 'pre-protein' or the immature form of the protein. 

However, not all secreted proteins contain an amino terminal signal sequence 
that is cleaved. Some proteins, such as ovalbumin, contain a signal sequence that is 
located on an internal region of the protein. This sequence is not normally cleaved 

3 5 during translocation. 
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Proteins normally found in the cytoplasm can be targeted for secretion by 
linking a signal sequence to the protein. This is readily accomplished by ligating DNA 
encoding a signal sequence to the 5' end of the DNA encoding the protein and then 
expressing this fusion protein in an appropriate host cell. The DNA encoding the 
5 signal sequence may be obtained as a restriction fragment from any gene encoding a 
protein with a signal sequence. Thus, prokaryotic, yeast, and eukaryotic signal 
sequences may be used herein, depending on the type of host cell utilized to practice 
the invention. The DNA encoding the signal sequence portion of the gene is excised 
using appropriate restriction endonucleases and then ligated to the DNA encoding the 

10 protein to be secreted, i.e. limonene synthase. 

Selection of a functional signal sequence requires that the signal sequence is 
recognized by the host cell signal peptidase such that cleavage of that signal sequence 
and secretion of the protein will occur. The DNA and amino acid sequence encoding 
the signal sequence portion of several eukaryotic genes including, for example, human 

15 growth hormone, proinsulin, and prealbumin are known (see Stryer, Biochemistry^ 
W.H. Freeman and Company, New York, NY, p. 769, [1988]) and can be used as 
signal sequences in appropriate eukaryotic host cells. 

Yeast signal sequences, as for example acid phosphatase (Arima et al., Nuc. 
Acids Res., 11:1657 [1983]), alpha-factor, alkaline phosphatase and invertase may be 

20 used to direct secretion from yeast host cells. Prokaryotic signal sequences from 
genes encoding, for example, LamB or OmpF (WongetaL, Gene 68:193 [1988]), 
MalE, PhoA, or beta-lactamase, as well as other genes, may be used to target proteins 
from prokaryotic cells into the culture medium. 

As described above, the limonene synthase amino terminal transit peptide 

25 resides at SEQIDNoll, residues 1 through 89, and in the embodiment shown in 
SEQIDNo:ll directs the enzyme to plastids for processing to the mature protein. 
Alternative transit sequences from plants, animals and microbes can be employed in 
the practice of the invention to direct the gene product to the cytoplasm, endothelial 
reticulum, mitochondria or other cellular components, or to target the protein for 

30 export to the medium. These considerations apply to the overexpression of limonene 
synthase or limonene, and to direction of expression within an intact organism to 
permit gene product function in any desired location. 

An alternative technique to provide a protein of interest with a signal sequence 
such that it may be secreted is to chemically synthesize the DNA encoding the signal 

35 sequence. In this method, both strands of an oligonucleotide encoding the selected 
signal sequence are chemically synthesized and then annealed to each other to form a 
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duplex. The double-stranded oligonucleotide is then ligated to the 5* end of the DNA 

encoding the protein. 

The construct containing the DNA encoding the protein with the signal 
sequence ligated to it can then be ligated into a suitable expression vector. This 
5 expression vector is transformed into an appropriate host cell and the protein of 

interest is expressed and secreted. 

Transformation of cultured plant host cells is normally accomplished through 
Agrobacterium tumifaciens, as described above. Cultures of mammalian host cells 
and other host cells that do not have rigid cell membrane barriers are usually 

10 transformed using the calcium phosphate method as originally described by Graham 
and VanderEb (Virology, 52:546 [1978]) and modified as described in 
sections 16.32-16.37 of Sambrook et al. supra. However, other methods for 
introducing DNA into cells such as Polybrene (Kawai and Nishizawa, Mol Cell 
Biol, 4:1172 [1984]), protoplast fusion (Schaffiier, Proc. Natl Acad. Sci. USA, 

15 77:2163 [1980]), electroporation (Neumann et al., EMBOJ., 1:841 [1982]), and 
direct microinjection into nuclei (Capecchi, Cell, 22:479 [1980]) may also be used. 

Yeast host cells are generally transformed using the polyethylene glycol 
method, as described by Hinnen (Proc. Natl Acad Sci. U.S.A., 75:1929 [1978]). 

Prokaryotic host cells or other host cells with rigid cell walls are preferably 

20 transformed using the calcium chloride method as described in section 1.82 of 
Sambrook et al., supra. Alternatively, electroporation may be used for transformation 
of these cells. 

Construction of suitable vectors containing DNA encoding replication 
sequences, regulatory sequences, phenotypic selection genes and the limonene 
25 synthase DNA of interest are prepared using standard recombinant DNA procedures. 
Isolated plasmids and DNA fragments are cleaved, tailored, and ligated together in a 
specific order to generate the desired vectors. 

The DNA is cleaved using the appropriate restriction enzyme or enzymes in a 
suitable buffer. (Appropriate buffers, DNA concentrations, and incubation times and 
30 temperatures are specified by the manufacturers of the restriction enzymes.) After 
incubation, the enzymes and other contaminants are removed by extraction of the 
digestion solution with a mixture of phenol and chloroform, and the DNA is 
recovered from the aqueous fraction by precipitation with ethanol. 

To ligate the DNA fragments together to form a functional vector, the ends of 
35 the DNA fragments must be compatible with each other. In some cases the ends will 
be directly compatible after endonuclease digestion. However, it may be necessary to 
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first convert the sticky ends, commonly produced by endonuclease digestion, to blunt 
ends to make them compatible for ligation. It is then purified by phenolchloroform 
extraction and ethanol precipitation. 

The cleaved DNA fragments may be size-separated and selected using DNA 
gel electrophoresis. The DNA may be electrophoresed through either an agarose or a 
polyacrylamide matrix The selection of the matrix will depend on the size of the 
DNA fragments to be separated. After electrophoresis, the DNA is extracted from 
the matrix by electrocution, or, if low-melting agarose has been used as the matrix, by 
melting the agarose and extracting the DNA from it, as described in 
sections 6.30-6.33 of Sambrook et al., supra. 

The DNA fragments that are to be ligated together (previously digested with 
the appropriate restriction enzymes such that the ends of each fragment to be ligated 
are compatible) are present in solution in about equimolar amounts. The solution will 
also contain ATP, ligase buffer and a ligase such as T4 DNA ligase. If the DNA 
fragment is to be ligated into a vector, the vector is first linearized by cutting with the 
appropriate restriction endonuclease(s) and then phosphatased with either bacterial 
alkaline phosphatase or calf intestinal alkaline phosphatase. This prevents self-ligation 
of the vector during the ligation step. 

After ligation, the vector with the foreign gene now inserted is transformed 
into a suitable host cell, most commonly a prokaryote such as E. coli K12 strain 294 
(ATCC number 31,446) or another suitable E. coli strain. The transformed cells are 
selected by growth on an antibiotic, commonly kanamycin (kan), tetracycline (tet) or 
ampicillin (amp), to which they are rendered resistant due to the presence of kan, tet 
and/or amp resistance genes on the vector. If the ligation mixture has been 
transformed into a eukaryotic mammalian host cell, transformed cells may be selected 
by the DHFR/MTX system described above. The transformed cells are grown in 
culture and the plasmid DNA (plasmid refers to the vector ligated to the foreign gene 
of interest) is then isolated. This plasmid DNA is then analyzed by restriction 
mapping and/or DNA sequencing. DNA sequencing is generally performed by either 
the method of Messing et al., Nucleic Acids Res., 9:309 (1981) or by the method of 
Maxam et al., Methods of Enzymology, 65:499 (1980). 

After mammalian host cells have been stably transformed with the DNA, the 
DHFR-protein-coding sequences are amplified by growing the host cell cultures in the 
presence of methotrexate. The effective range of concentrations of MTX is highly 
dependent upon the nature of the DHFR gene and protein and the characteristics of 
the host. Clearly, generally defined upper and lower limits cannot be ascertained. 
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As discussed above, Hmonene synthase variants are preferably produced by 

means of mutates) that are generated using the method of site-specific mutageneses. 
This method requires the synthesis and use of specific oligonucleotides that encode 
both the sequence of the desired mutation and a sufficient number of adjacent 
nucleotides to allow the oligonucleotide to stably hybridize to the DNA template. 

The forgoing may be more fully understood in connection with the following 
representative examples, in which "Plasmids" are designated by a lower case p 
followed by an alphanumeric designate. The starting plasmids used in this invention 
are erther commercially available, publicly available on an unrestricted basis, or can be 
constructed from such available plasmids using published procedures. In addition, 
other equivalent plasmids are known in the art and will be apparent to the ordinary 

15 artisan. . 

"Dieestion", "cutting" or "cleaving" of DNA refers to catalytic cleavage of the 

DNA within enzyme that acts only at particular locations in the DNA. These 

enzymes are called restriction endonucleases, and the site along the DNA sequence 

where each enzyme cleaves is called a restriction site. The restriction enzymes used in 

20 this invention are commercially available and are used according to the instructions 

supplied by the manufacturers. Restriction enzymes are designated by abbreviations 

composed of a capital letter followed by two or three lower case letters representing 

the microorganism from which each restriction enzyme was obtained. These letters 

are followed by one or more Roman numerals that identify the particular enzyme. In 

general about 1 ug of plasmid or DNA fragment is used with about 2 units of enzyme 

in about 20 nl of buffer solution. The appropriate buffer, substrate concentration, 

incubation temperature, and incubation time for each enzyme is specified by the 

manufacturer. After incubation, the enzyme and other contaminants are removed 

from the DNA by extraction with a solution of phenol-chloroform, and the digested 

DNA is recovered from the aqueous fraction by precipitation with ethanol. Digestion 

with a restriction enzyme may be followed by treatment with bacterial alkaline 

phosphatase or calf intestinal alkaline phosphatase. This prevents the two restriction 

cleaved ends of a DNA fragment from "circularizing" or forming a closed loop that 

would impede insertion of another DNA fragment at the restriction site. Unless 

otherwise stated, digestion of plasmids is not followed by 5' terminal 



25 
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dephosphorylation. These procedures and reagents for dephosphorylation are 
described in sections 1.60-1.61 and sections 3.38-3.39 of Sambrook et al., supra. 

"Recovery" or "isolation" of a given fragment of DNA from a restriction 
digest means separation of the resulting DNA fragment on a polyacrylamide or an 
5 agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular 
weight, removal of the gel section containing the desired fragment, and separation of 
the gel from: DNA This procedure is known generally. For example, see Lawn et al. 
{Nucleic Acids Res 9:6103-6114 [1982]), and Goeddel et al. {Nucleic Acids Res, 
10 supra) 

"Southern Analysis" is a method by which the presence of DNA sequences in a 
digest or DNA-containing composition is confirmed by hybridization to a known, 
labeled oligonucleotide or DNA fragment. Southern analysis refers to the separation 
of digested DNA on an agarose gel, denaturation of the DNA, and transfer of the 

15 DNA from the gel to a nitrocellulose or nylon membrane using methods originally 
described by Southern {J.MoL Biol, 98:503 [1975]) and modified as described in 
sections 9.31-9.57 of Sambrook et al., supra. 

"Transformation" means introducing DNA into an organism so that the DNA 
is replicable, either as an extrachromosomal element or chromosomal integrant. The 

20 method used for transformation depends on whether the host cell is a eukaryote or a 
prokaryote. The method used to transform prokaryotes is the calcium chloride 
method as described in section 1.82 of Sambrook et al., supra. Eukaryotes are 
transformed using the calcium phosphate method as described in sections 16.32-16.37 
of Sambrook et al., supra. 

25 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded DNA fragments using the enzyme ligase in a suitable buffer that also 
contains ATP. 

"Oligonucleotide" refers to short length single or double stranded sequences of 
deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
30 chemically synthesized by known methods and purified on polyacrylamide gels. 

The following examples merely illustrate the best mode now contemplated for 
practicing the invention, but should not be construed to limit the invention. All 
literature citations herein are expressly incorporated by reference. 
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EXAMPLES 

Example 1 

Plant Material and Limonene Synthase Isolation 
Plant materials - Spearmint (Mentha spicata L.J plants were propagated from 
5 rhizomes or stem cuttings in peat moss:pumice:sand (58:35:10, v/v/v) and were 
grown in a greenhouse with supplemental lighting (16 h, 21,000 lux minimum) and a 
30°/15°C (day/night) temperature cycle. Plants were watered as needed and fertilized 
daily with a complete fertilizer (N:P:K, 20:20:20) plus iron chelate and 
micronutrients. Apical buds and newly emerged, rapidly-expanding leaves (5-10 mm 
10 long) of vegetative stems (3-7 weeks old) were used for nucleic acid isolation and for 
the preparation of glandular trichome cells for enzyme extraction. [l- 3 H]Geranyl 
pyrophosphate (120 Cl/mol) was prepared by NaB 3 H4 (DuPont/NEN) reduction of 
geranial, followed by pyrophosphorylation and separation of the resulting products 
according to the procedure of Croteau and Karp, Arch. Biochem. Biophys. 
15 176:734-746 [1976], Final purification was achieved by anion-exchange 
chromatography of a DEAE-cellulose column eluted with a linear gradient 
(10-80 mM) of (NH 4 ) 2 C0 3 , pH 9.0. The product gave a single band (Rf=0.4) by 
TLC on buffered silica gel G (100 mM (NH 4 ) 2 S0 4 , pH 9.0) developed with n- 
propanol:conc. NH 4 OH:H 2 0 (6:3:1, v/v/v). 
20 Limonene synthase isolation - Limonene synthase was extracted from a 

purified preparation of glandular trichome secretory cell clusters. To obtain these 
clusters, plant material was soaked in ice-cold, distilled water for 1 h and gently 
abraded in a cell disrupter (Bead-Beater, Bio-Spec Products, Bartlesville, OK). 
Batches of 20 g of plant material were abraded in the 300 ml polycarbonate chamber 
25 of the cell disrupter with 130 g glass beads (0.5 mm diameter, Bio-Spec Products), 
20 g Amberlite XAD-4 resin and buffer (25 mM Hepes, pH 7.3, containing 200 mM 
sorbitol, 0.8% (w/v) methyl cellulose, 10 mM sucrose, 10 mM KC1, 5 mM MgCl 2 , 
0.5 mM potassium phosphate and 1% (w/v) polyvinylpyrrolidone, M r = 40,000) 
added to full volume. Larger batches of plant material were abraded in 600 ml 
30 chambers of our own manufacture that had relative proportions similar to the 300 ml 
chamber. Removal of glandular trichome secretory cells was accomplished by three 
1 min pulses of operation with the rotor speed controlled by a rheostat set at 90 V. 
This procedure was carried out at 4 D C, and after each pulse the chamber was allowed 
to cool for 1 min The isolated secretory cell clusters were separated from the glass 
35 beads, XAD-4 resin and residual plant material by sieving through a series of nylon 
meshes. The secretory cell clusters (approximately 60 urn in diameter) readily 
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passed through meshes of 350 and 105 [im and were collected on a mesh of 20 \±m. 
After filtration, cell clusters were suspended in the cell disruption buffer and 
allowed to settle for 30 min at 4°C, after which the upper few ml of buffer, 
containing non-glandular trichomes and other low-density matter were discarded. 
5 Suspensions of isolated secretory cell clusters (3 to 5 x 10 5 clusters/ml) 

were disrupted by sonication with a microprobe (Braun-Sonic 2000) at maximum 
power for two 1-min bursts. Sonication was carried out in a buffer of 25 mM 
potassium phosphate, pH 6.0, containing 25% (v/v) XAD-4 resin. After 
sonication, the resulting extract was filtered through 20 urn nylon mesh, and the 
10 filtrate centrifuged at 16,000 g for 15 min (pellet discarded) and again at 195,000 g 
for 90 min to provide the soluble supernatant used as the enzyme source. This 
crude enzyme solution could be stored at -20°C for two months without significant 
loss of activity. 

Example 2 

1 5 Limonene synthase purification 

All procedures were carried out at 0-4°C. The 195,000 g supernatant from 
Example 1 was dialyzed against 25 mM Mes buffer, pH6.1, containing 10% (w/v) 
glycerol and 1 mM dithiothreitol, and, after being adjusted to 20 mM MgCl2 ? was 
desalted (Econo-Pac 10DG desalting column, Bio-Rad) into 30 mM Mes buffer, 

20 pH 6.1, containing 10% (v/v) glycerol, 100 mM KC1 and 0.5 mM dithiothreitol, and 
then applied to a DEAE-Sepharose Fast Flow anion-exchange column (10 x 100 mm; 
Pharmacia FPLC System). Proteins were eluted with a 24 ml linear gradient from 100 
to 500 mM KC1 in the loading buffer. The limonene synthase activity eluted as a 
single peak between 300-340 mM KCI. The eluate was subjected to dye-ligand 

25 chromatography on a 3 ml column of Matrex Gel Red A (3 mg dye/ml, Amicon) 
pre-equilibrated with the same buffer containing 20 MgCl 2 . The column was washed 
with 50 ml of the equilibration buffer to remove unbound proteins, and the limonene 
synthase activity was then eluted with 20 ml of 100 mM potassium phosphate, pH 7.6, 
containing 10% (v/v) glycerol and 0.5 mM dithiothreitol. The purity of the protein 

30 was judged to be -99% by SDS-PAGE (Laemmli, Nature 227:680-685 [1970]) with 
silver staining (Wray et al., Anal. Biochem. 118:197-203 [1981]). 

The eluate from anion-exchange chromatography was mixed with an equal 
volume of 100 mM potassium phosphate buffer, pH 7.0, containing 2 M (NH 4 ) 2 S04, 
5% (v/v) glycerol and 0.5 mM dithiothreitol, and subjected to hydrophobic interaction 

35 chromatography (HIC) on an HRLC MP7 HIC column (Cj ligand, Bio-Rad), pre- 
equilibrated with 100 mM potassium phosphate buffer, pH 7.0, containing 1M 
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(NH^SO^ 5% (w/v) glycerol and 0.5 mM dithiothreitol The column was 
developed with a 10 ml reversed gradient from 1.0 to 0 0 M (NH 4 ) 2 S0 4 (1ml 
fractions, 1 ml/min) in the same buffer. The purified limonene synthase activity eluted 
in a single peak (2 ml) between 300-100 mM (NH 4 ) 2 S0 4 . 

To obtain sufficient quantities of enzyme product for characterization, 
limonene synthase was partially purified by subjecting the 195,000 g supernatant to 
anion-exchange chromatography on 20 g of DEAE-cellulose (DE-52, Whatman). 
The column was equilibrated with 15 mM sodium phosphate buffer, pH6.0, 
containing 1 mM sodium ascorbate, 0.5 mM dithiothreitol and 0.2 mM EDTA. Ater 
the 195,000 g supernatant was loaded, the column was washed with 100 ml 
equilibration buffer containing 100 mM NaCl, and the limonene synthase activity then 
eluted with 100 ml of the same buffer containing 250 mM NaCI. Following anion- 
exchange chromatography on DEAE-cellulose, the limonene synthase appeared to 
represent 40% of the total protein as judged by SDS-PAGE (data not shown). 

Gel electrophoresis - SDS-PAGE was carried out according to Laemmli 
{Nature, supra) in 10% polyacrylamide vertical slab gels (16cmx 18 cmx 1 mm, 
Hoefer SE600 vertical gel apparatus). Aliquots of the purified limonene synthase 
from above (-50 jag) were dialyzed against water, lyophilized, resuspended in SDS 
sample buffer (Laemmli et al., Nature, supra) and denatured on a steam bath for 5 min 
before loading. Gels were run at a constant current of 20 mA for about 3 h, and were 
stained with 0.25% (w/v) Coomassie Brilliant Blue R-250 in methanol: acetic 
acid:H 2 0 (30:10:60) and destained in methanol: acetic acid:H 2 0 (35:10:55). Some 
gels were also silver stained (Wrayetal., Anal. Biochem. 118:197-203 [1981]). 
Molecular weight markers were obtained from Diversified Biotech. 

For nondenaturing PAGE, a 10% polyacrylamide vertical slab gel 
(16 cm x 18 cm x 1 mm) employing a discontinuous buffer system (Jouin, 
Biochemistry 12:871-879 [1973]) without SDS was cast and subjected to pre- 
electrophoresis for 30 min at 20 mA in buffer containing 1 mM sodium thioglycolate 
to remove residual persulfate used in polymerization (Moos et al., J. Biol. Chem. 
263:6005-6008 [1988]). Protein samples were dialyzed against 62 mM Tris-HCl 
buffer, pH 6.6, containing 5 mM dithiothreitol and 10% (w/v) glycerol, and aliquots 
containing 100-300 jig protein were loaded into 1 cm-wide wells and electrophoresed 
at a constant voltage of 25 V for 20 h at 4°C. Limonene synthase activity was 
localized by assay of gel slices (0.8 cm) in 1.0 ml of assay buffer (see below), this 
activity migrated with an R m = 0.82. 
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Example 3 
Limonene Synthase Activity Assay 
The reaction was initiated by addition of 10 |jM (l- 3 H)geranyl pyrophosphate. 
Pentane (1 ml) was carefully layered on top of the aqueous assay mixture to trap 
5 volatile products. After incubation for 1 h at 30°C with gentle agitation, the reaction 
was stopped by chilling and vigorous mixing, NaCl was added to saturation, and the 
assay tube was centrifuged to separate the phases. The pentane layer was removed, 
the aqueous phase reextracted with an additional 1 ml of pentane, and the combined 
pentane extract was passed over a short column of silica gel (SilicAR 60 

10 A. Mallinckrodt) overlaid with MgS0 4 in a Pasteur pipet This procedure affords an 
extract containing only hydrocarbons, such as limonene, and is free of oxygenated 
products which remain adsorbed to the silica gel. When an evaluation of the full 
range of metabolites produced in the assay was required (i.e., including oxygenated 
products, such as geraniol, liberated from the substrate by endogenous 

15 phosphohydrolases), the remaining aqueous phase was also extracted with diethyl 
ether and the ether passed through the original silica gel column to recover the 
oxygenated products. Radioactivity in the pentane and diethyl ether eluates was 
determined by liquid scintillation counting in 10 ml of cocktail containing 0.4% (w/v) 
Omnifluor (Du Pont) dissolved in 30% ethanol in toluene, using a Packard Tricarb 

20 460 CD liquid scintillation spectrometer ( 3 H efficiency = 42%). For product 
identification by GLC or radio-GLC, aliquots of the assay extracts were diluted with 
the appropriate internal standards and concentrated under a stream of N 2 . Preparative 
incubations, used to generate product for subsequent characterization, were carried 
out in a slightly modified assay buffer of 10 mM Mopso-5 mM sodium phosphate, 

25 pH7.0, containing 10% (w/v) glycerol, 1 mM sodium ascorbate, 1 mM MnCl 2 and 
0.5 mM dithiothreitol. Protein concentrations were estimated by the dye-binding 
technique of Bradford (Anal Biochem. 72:248-254 [1976]) using the Bio-Rad 
protein assay kit with lysozyme as standard. Both enzyme activity measurements and 
protein determinations were reproducible to within 10%. 

30 Example 4 

Amino acid analysis and protein sequencing 
Approximately 25 [ig of purified limonene synthase was subjected to acid 
hydrolysis and subsequent amino acid analysis at the Washington State University 
Bioanalytical Center. In preparation for proteolysis and peptide sequencing, 

35 approximately 75 [xg of purified protein was subjected to SDS-PAGE according to 
the method of Schagger and von Jagow (Schagger and von Jagow, Anal Biochem. 



WO 95/1 1913 



-36- 



PCT/LS94/ 12379 



166:368-379 [1987]) in 10% polyacrylamide venical slab gels 
(16 cm x 18 cm x 0.7 mm). Gels were stained with Coomassie Brilliant Blue R-250 in 
methanol: acetic acid:water (30:10:60, v/v/v) and destained (in methanol: acetic 
acid water (10:10:80 v/v/v) and the gel bands containing the limonene synthase (at 
5 56 kD) were excised, washed with water, incubated in SDS sample buffer (Laemmli, 
supra) for 30 min at 30°C, and then inserted into the sample wells of a 16.5% 
polyacrylamide vertical slab gel (16 cm x 18 cmx 1.0 mm) for SDS-PAGE as before 
(Schagger and von Jagow, supra). Each gel slice was overlaid with buffer consisting 
of 0.125 M Tris (pH 6.8), 1 mM EDTA, 2.5 mM dithiothreitol and 0.1% (v/v) SDS 

10 and containing 2 jig V8 protease (Sigma) according to the standard practice 
(Cleveland et al., J. Biol Chem, 252:1102-1106 [1977]). Samples were 
electrophoresed until the bromophenol blue dye had traveled about 1 cm to 
concentrate the protein into the stacking gel, and the power was turned off for 50 min 
to permit proteolytic digestion. Electrophoresis was then continued for 20 h at a 

1 5 constant voltage of 90v. 

For sequence analysis, polyvinylidenedifluoride membranes (Immobilon-P s Q, 
Millipore) were first wetted in methanol and equilibrated in 25 mM Tris-190mM 
glycine (pH 8.3) containing 20% (v/v) methanol and then gel blotted in the same 
buffer (Towbinetal, Proa Natl. Acad Sci. USA 76:4350-4354 [1979]). 

20 Electroblotting was carried out for 90 min at a constant current of 100 mA using a 
Hoefer TE 70 semi-dry transfer apparatus. Peptides were sequenced via Edman 
degradation on an Applied Biosystems 470 sequenator at the Washington State 
University Laboratory for Bioanalysis and Biotechnology. 

For CNBr cleavage, approximately 50 jag of purified limonene synthase was 

25 lyophilized in a 1.5 ml microfuge tube and dissolved in 200 pal of a degassed solution 
containing 10 p.g/fj.1 CNBr in 70% aqueous formic acid. After thorough mixing and 
flushing with Ar, the mixture was incubated in the dark for 20 h at 20°C. The sample 
was dried and excess CNBr and formic acid removed under vacuum (Savant Speed 
Vac), and the wall of the tube was washed down with 200 pi 0.001 N nH 4 OH and the 

30 sample redried under vacuum The sample was then resuspended in SDS buffer 
(Laemmli, supra) and subjected to SDS-PAGE as before (Schagger and von Jagow, 
supra) in 16.5% polyacrylamide vertical slab gels (16 cm x 18 cm x 1.0 mm), 
electroblotted to polyvinylidenedifluoride (Immobilon P S Q), and the peptides 
subjected to Edman degradative sequencing as described above for peptides generated 

35 by V8 proteolysis. Alternatively, the CNBr-generated peptides were dissolved in 10% 
aqueous acetonitrile with 0.1% (v/v) trifluoroacetic acid (starting solvent) and 
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separated by reversed phase HPLC (Rainin, C4-Dynamax 3 00 A column) with a linear 
gradient from starting solvent to 80% aqueous acetonitrile with 0.07% trifluoroacetic 
acid. Purified peptides in the lyophilized chromatographic fractions were sequenced 
as above. 

5 Example 5 

Plasmid Formation and Screening 
Isolation of nucleic acids - Newly emerging spearmint leaves with a very high 
density of developing epidermal oil glands were used as starting material. Total RNA 
(yield —1 mg/g fr. tissue wt.) was extracted using the procedures of Cathala and 

10 associates (Cathala et aL, DNA 2:329-335 [1983]), with the substitution of 300 mM 
for 50 mM Tris in the tissue homogenization buffer. Poly(A) + RNA was purified by 
chromatography on an oligo(dT)-cellulose column (Pharmacia). Plasrnid DNA was 
amplified in Escherichia coli strain XLl-Blue and was prepared by standard 
procedures (Sambrook et al., supra). 

15 Library construction and screening - A spearmint leaf cDNA library was 

constructed from 5 |ig poly(A) + mRNA using the ZAP-cDNA synthesis kit with 
lambda UniZap XR vector, and was packaged using the Gigapack II Packaging 
Extract according to the manufacturer's instructions (Stratagene). Using the sequence 
information obtained from peptides generated by CNBr cleavage and V8 proteolysis 

20 of limonene synthase, three degenerate oligonucleotide probes were synthesized (at 
the Washington State University Laboratory for Bioanalysis and Biotechnology) (see 
FIGURE 2) and end labeled with 32 P using T4 kinase (Bethesda Research 
Laboratories) by standard procedures (Titus, Promega Protocols and Applications 
Guide, 2nd Ed., Promega Corp., Madison, WI, [1991]). The three probes were used 

25 to screen replicate filter lifts of 2.5 x 10 5 primary plaques grown in E. coli PLK-F' 
using the Stratagene protocols. The hybridization conditions were modified from the 
NEN procedure using 6xSSC (1 x SSC = 0.15 MNaCl in 0.015 M sodium citrate, 
pH 7.0) containing 1% SDS, 10% dextran sulfate, and 50 ug/ml each of salmon 
sperm and E. coli DNA at 47°C for 20 h. Blots were washed twice in 6 x SSC for 

30 5 min at 47°C, thrice in 3 x SSC for 30 min, dried in a vacuum oven and, as in all 
instances described here where 32 P-labeled probes were used, autoradiograms were 
prepared with Kodak XAR-5 film exposed overnight at -80°C Plaques affording 
positive signals with each of the three probes (a total of 20 were picked) were 
rescreened on E. coli XLl-Blue through four cycles using each probe individually 

35 until clones pLC 4.1, pLC 5.2, pLC 8.5 and pLC 10.1, that hybridized to all three 
probes, were pure. The selected lambda UniZap XR clones were in vivo excised and 
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recircularized as Bluescript II SK(+) phagemids that were then employed to infect 
E. coli XLl-Blue using procedures provided by Stratagene 

Example 6 
cDNA Expression 

5 Expression of limonene synthase cDNA - E, coli XLl-Blue cells containing 

Bluescript II SK(+)> pLC4.1, pLC 5.2, pLC 8.5 or pLClO.l were grown to 
stationary phase in 5 mis Luria-Bertani medium with 100 ng/ml ampicillin and used to 
inoculate 100 ml liquid cultures either without or with 50 ^ig/ml antibiotic which were 
incubated at 37°C with shaking at 350 rpm. When the cultures reached A^q = 0.5, 

10 either 1 mM or 10 mM EPTG was added to induce the cultures which were incubated 
for an additional 2 h. Cells were collected by centrifugation at 2400 g for 10 min, 
resuspended in 20 mM Tris (pH 7.5) and centrifuged again, then suspended in 2.5 mL 
buffer containing 20 mM Mopso (pH 7.0), 1 mM EDTA, 1 mM dithiothreitol and 
10% (v/v) glycerol and disrupted by sonication (Braun-Sonic 2000 with microprobe 

15 at maximum power for four 1-min bursts at 0-4°C). The bacterial homogenates were 
then clarified by centrifugation at 13,000 g (the pellets contained negligible limonene 
cyclase activity), the resulting supernatants dialyzed to standard assay conditions, and 
the assay for limonene cyclase activity carried out under linear conditions as described 
above (Alonso et al., X Biol Chem., supra). In some instances, portions of these 

20 preparations were taken for immunoblotting (described below) or were partially 
purified by anion exchange chromatography. In the latter case, the supernatant was 
desalted into 15mM potassium phosphate buffer (pH 6.0) containing 10% (v/v) 
glycerol, 150mM KC1 and 1 mM dithiothreitol, and then applied to a DEAE- 
Sepharose Fast Flow column (5 x 100 mm, Pharmacia FPLC system) that was eluted 

25 with a linear gradient from 150 to 600 mM KC1 in loading buffer. This limonene 
cyclase activity expressed inE. coli eluted as a single peak between 365 and 465 mM 
KC1, as with the native enzyme from Mentha. 

To examine the possible production of limonene in transformed E. coli, 
cultures harboring pLC 5.2 (showing the highest limonene synthase specific activity 

30 by in vitro assay) were grown as before to various densities corresponding to 
A500 = 0-3 to 0.8. IPTG (10 mM) was added and the induced cultures were 
incubated for an additional 1-2 h to provide enzyme activity levels ranging from 10 to 
50 nmol/h/culture; in some instances, a pentane overlay (10 ml/culture) was added to 
trap the volatile olefinic product. Following incubation, the cultures were chilled in 

35 ice, saturated with NaCl, homogenized and extracted with pentane (3x10 ml), and 
the combined pentane extract was passed over a silica gel column to provide the 
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hydrocarbon fraction, free of oxygenated metabolites. The internal standard was 
added (lOnmol /?-cymene) and the solvent concentrated under a stream of N 2 in 
preparation for analysis of the hydrocarbon fraction by capillary GLC 

Example 7 
Cloning in pET Vector 
Cloning into the pET vector requires two unique restriction sites in the target 
gene: a Ndel site overlapping the translation initiation codon and a BamHl site at the 
3' end. The latter site is easily engineered through cloning, in the correct orientation, 
into the polyiinker region of pBluescript SK II (Stratagene) in which the gene at this 
stage will residue. The Ndel site can be engineered through use of PCR. In the case 
of the (-)-limonene synthase cDNA (nascent protein), there is an additional Ndel site 
within the 3 -untranslated sequence. In order to remove this Ndel site, add an Ndel 
site at the initiation codon, and position a BamHl site at the 3* end of the clone, the 
following strategy was employed to express the limonene synthase preprotein. The 
entire coding region of the pLC 5.2 cDNA was amplified by PCR using two primers: 
one primer was positioned downstream of the coding region but upstream of the 
existing Ndel site; the other primer corresponded to sixteen nucleotides of a 
Kpnl/Ndel linker overlapping with the first twenty nucleotides of the pLC 5.2 coding 
region. The amplified product was blunted at both ends, using the Klenow fragment 
of DNA polymerase I, then digested with Kpnl in order to create a sticky end for 
directional cloning. This product was then cloned into pBluescript SK II digested 
with Kpnl and Sma\. The resulting plasmid contained the entire pLC 5.2 coding 
region with the requisite Ndel and BarriHl restriction sites. After limited sequencing 
to confirm that no errors were introduced by PCR, the pLC 5.2 coding region was 
excised with Ndel and BamHl and cloned directly into the corresponding sites of the 
His^Tag pET vector. 

The verified plasmid was used to transform the E. coli BL21(DE3)pLysS 
strain and, following induction, growth at room temperature, harvest and purification 
by metal ion affinity chromatography/thrombin proteolysis, gave -15% yield of the 
recombinant nascent limonene synthase as total cellular protein. Little proteolysis of 
the nascent synthase transit peptide was observed in E. coli BL21(DE3) strain in vivo, 
perhaps because of strain differences (BL21 is deficient in several major proteases), 
the fact that the preprotein arises as the (His) 6 -fusion protein, or that protein synthesis 
simply "out-runs" proteolysis in this system. 
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Example 8 
Limonene Synthase Analysis 

Product analysis and other analytical methods - Radio-GLC was performed 
on a Gow-Mac 5 5 OP gas chromatograph attached to a Packard 894 gas proportional 
counter (Satterwhite and Croteau, J. Chromatogr. 452:61-73 [1987]) under 
conditions designed to separate limonene from all other naturally occurring 
monoterpene olefins (Rajaonariyony et al.,Arch. Biochem. Biophys., supra). 

Capillary GLC was utilized for the analysis of limonene production by 
transformed E. coli cultures (Hewlett-Packard 5890: injector, 220°C; detector, 
300°C; split ratio, 5:1; H 2 carrier at 14 psi. column: 0.25 mm i d. x 30 m with 0.25 \x 
m film of AT- 1000 (Alltech); programmed from 35°C (5 min) to 230°C at 10°C/min). 
Based on internal standardization with /?-cymene, and the detector response, an 
amount of limonene equivalent to 0.5 nmol per culture can be easily determined by 
this method. 

Procedures for immunoblotting on nitrocellulose using polyclonal antibodies 
generated in rabbits against spearmint limonene synthase, and detection by alkaline 
phosphatase conjugated goat anti-rabbit IgG, have been described (Alonso et a!., 
Arch. Biochem. Biophys.^ supra). Cross-reacting antibodies to E. coli proteins were 
removed by subjecting the antiserum to affinity chromatography using a column of 
soluble proteins from XL-1 Blue/Bluescript II SK(+) linked to CNBr-activated 
Sepharose (Sigma) following established protocols (Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
[1988]). In some instances, horseradish peroxidase conjugated goat anti-rabbit IgG 
(1:2,000; Bio-Rad Laboratories) was substituted as the secondary antibody, and 
positives were detected by enhanced chemiluminescence on Kodak XAR-5 film after 
treating the filters with the Amersham enhancing reagents. For quantitation, blots 
were scanned at 633 nm with an LKB 2202 Ultroscan laser densitometer. In most 
instances, —100 |ig of E. coli protein was employed and was dialyzed against 20 mM 
Tris (pH 6.8) containing 1 mM dithiothreitol and 10% (v/v) glycerol, lyophilized, and 
thermally denatured in the presence of SDS and £-mercaptoethanol (Laemmli, supra) 
before use. Protein concentrations were estimated by dye-binding using the Bio-Rad 
Protein Assay, and Rainbow molecular weight markers for use in blotting were 
obtained from Amersham. 

For RNA blot analysis, spearmint and peppermint poly(A) + RNA (3 p.g each) 
were separated on a 1.5% formaldehyde agarose gel and blotted onto Genescreen 
Plus nylon membranes (NEN). Probe DNA was prepared by digesting pLC 5.2 with 
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Sma\ and Apal, separated by agarose gel electrophoresis, and electrocuted using 
standard procedures (Titus, supra). The cDNA insert was then labeled with 32 P via 
the hexamer reaction (Cathala et al, supra). Hybridization according to the NEN 
protocol was for 16 h at 42°C in 15 mis of hybridization solution consisting of 
5 5 x SSPE (1 x SSPE = 150 mM NaCl, 10 mM sodium phosphate and 1 mM EDTA), 
50% deionized formamide, 5 x Denhardts, 1% SDS, and 100j^g/ml denatured, 
sheared salmon sperm DNA. Blots were washed twice for 5 min with SSPE at room 
temperature, thrice at 65°C for 45 min with 2 x SSPE containing 2% SDS, and finally 
thrice for 15 min with 0. 1 x SSPE at room temperature. 

10 Sequencing double-stranded phagemid DNA of pLC 5.2 and the other clones 

was by chain termination method using Sequenase Version 2.0 (United States 
Biochemical). Both strands of the pLC 5 .2 cDNA were completely sequenced using 
T7 and T3 primers with additional primers synthesized as needed. DNA fragments 
were assembled and the sequence analyzed using SEQAID II. version 3.81 (a public 

15 domain program provided by the authors D.D. Rhodes and DJ. Roufa, Kansas State 
University), and the Genetics Computer Group Packet (Devereaux et al., supra). The 
sequence of the positive coding strand is set forth in SEQIDNo:8 and shown in 
FIGURE 4. 

While the preferred embodiment of the invention has been illustrated and 
20 described, it will be appreciated that various changes can be made therein without 
departing from the spirit and scope of the invention. For example, sequence 
variations from those described and claimed herein as deletions, substitutions, 
mutations, insertions and the like are intended to be within the scope of the claims 
except insofar as limited by the prior art. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Crouteau, Rodney B. 

Colby, Sheila M. 

(ii) TITLE OF INVENTION: DNA Encoding Limonene Synthase From 
Mentha Spicata 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: , Dce 

(A) ADDRESSEE: Christensen, O'Connor, Johnson & Kindness 

(B) STREET: 2800 Pacific First Centre, 1420 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98101-2347 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.2b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Sheiton, Dennis K. 

(B) REGISTRATION NUMBER: 26,997 

(C) REFERENCE/ DOCKET NUMBER: WSUR-1-7219 

(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (206) 224-0718 

(B) TELEFAX : (206) 224-0779 

(C) TELEX: 4938023 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Glu Leu Glu Lys Glu Thr Asp Gin He Arg Gin Leu Glu Leu lie Asp 

^ 10 15 



Asp Leu Gin Arg Met 

20 
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(2) INFORMATION FQR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asn Pro Val Val Leu Glu Leu Ala lie Leu Asp Leu Asn lie Val Gin 
15 10 15 

Ala Gin Phe Gin Glu Glu Leu Lys Glu Ser Phe 

20 , 25 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Gly Lys Val Asn Ala Leu lie Thr Val lie Asp Asp He Tyr Asp Val 
15 10 15 

Tyr Gly Thr Leu 

20 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GARAARGANA C RGA Y CARAT 2 0 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: N is inosine 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AAYATHGTNC ARGCNCARTT YCA 23 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
AAR GAY AT H T AYGAYGT ^ 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAGAGAGAAA GAGAGCGGAA G GAAAGAT T A AT CAT GG CT C TCAAAGTGTT AAGTGTTGCA 60 
ACTCAAATGG CGATTCCTAG CAACCTAACG ACATGTCTTC AACCCTCACA CTTCAAATCT 12 0 

TCTCCAAAAC T GT TAT C TAG CACTAACAGT AG TAG T 156 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



AGAGAGAGAG 


AGGAAGGAAA 


GATT AAT CAT 


GGCTCTCAAA 


GTGTTAAGTG 


TTGCAACTCA 


60 


AATGGCGATT 


CCTAGCAACC 


TAACGACATG 


TCTTCAACCC 


TCACACTTCA 


AATCTTCTCC 


120 


AAAACTGTTA 


TCTAGCACTA 


ACAGTAGTAG 


TCGGTCTCGC 


CTCCGTGTGT 


ATTGCTCCTC 


180 


CTCGCAACTC 


ACTACTGAAA 


GACGATCCGG 


AAACTACAAC 


CCTTCTCGTT 


GGGATGTCAA 


240 


CTT CAT C CAA 


TCGCTTCTCA 


GTGACTATAA 


G GAG G AC AAA 


C AC GT GAT T A 


GGGCTTCTGA 


300 


GCTGGTCACT 


TTGGTGAAGA 


TGGAACTGGA 


GAAAGAAACG 


GATCAAATTC 


GACAACTTGA 


360 
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GT T GAT C GAT GACTTGCAGA GGATGGGGCT GTCCGATCAT TTCCAAAATG AGTTCAAAGA 420 

AATCTTGTCC T C TAT AT AT C TCGACCATCA CTATTACAAG AACCCTTTTC CAAAAGAAGA 4 80 

AAGGGATCTC TACTCCACAT CTCTTGCATT TAGGCTCCTC AGAGAAC AT G GTTTTCAAGT 54 0 

CGCACAAGAG G TAT T C GAT A GTTTCAAGAA CGAGGAGGGT GAGTT CAAAG A*VAG C C T TAG 600 

CGACGACACC AGAGGATTGT TGCAACTGTA TGAAGCTTCC TTTCTGTTGA CGGAAGGCGA 660 

AACCACGCTC GAG T C AG C G A GGGAATTCGC CACCAAATTT T T G GAGGAAA AAGTGAACGA 72 0 

GGGTGGTGTT GATGGCGACC TTTTAACAAG AAT C G CAT AT TCTTTGGACA TCCCTCTTCA 78 0 

TTGGAGGATT AAAAGGCCAA AT GCACCTGT GTGGATCGAA TGGTATAGGA AGAGGCCCGA 84 0 

CATGAATCCA GTAGTGTTGG AGCTTGCCAT ACTCGACTTA AAT AT T GT T C AAGCACAATT 9 00 

TCAAGAAGAG CT CAAAGAAT CCTTCAGGTG GTGGAGA^AT ACTGGGTTTG TTGAGAAGCT 9 60 

GCCCTTCGCA AG G GAT AGAC TGGTGGAATG CTACTTTTGG AAT AC T G G G A TCATCGAGCC 102 0 

ACGTCAGCAT GCAAGTGCAA G GAT AAT GAT GGGCAAAGTC AACGCTCTGA TTACGGTGAT 10 80 

C GAT GAT ATT TAT GAT GT C T ATGGCACCTT AGAAGAACTC GAACAATTCA CTGAC C T CAT 114 0 

TCGAAGATGG GATATAAACT CAATCGACCA ACTTCCCGAT TACAT GCAAC TGTGCTTTCT 12 0 0 

TGCACTCAAC AACTTCGTCG ATGATAC AT C GTACGATGTT AT GAAG GAGA AAGGCGTCAA 12 60 

CGTTATACCC TACCTGCGGC AATCGTGGGT TGATTTGGCG GATAAGTATA TGGTAGAGGC 1320 

ACGGTGGTTC TACGGCGGGC AC AAAC CAAG T T T G GAAG AG TATTTGGAGA ACTCATGGCA 138 0 

GTCGATAAGT GGGCC CTGTA TGTTAACGCA CAT AT T CT T C CGAGTAACAG ATTCGTTCAC 144 0 

AAAGGAGACC GTCGACAGTT TGTACAAATA C C AC GAT T T A GTTCGTTGGT CATCCTTCGT 15 0C 

TCTGCGGCTT GCTGATGATT TGGGAACCTC GGT GGAAGAG GT GAG CAGAG GGGATGTGCC 156C 

GAAATCACTT CAGTGCTACA T GAGT GACTA CAAT G CAT C G GAGGCGGAGG CGCGGAAGCA 162 0 

CGTGAAATGG CTGATAGCGG AGGTGTGGAA GAAGAT GAAT GCGGAGAGGG T GT C GAAG GA 168 0 

TTCTCCATTC GGCAAAGATT TTATAGGATG TGCAGTTGAT T TAG GAAG GA TGGCGCAGTT 174 0 

GAT GTAC CAT AAT G GAG AT G GGCACGGCAC ACAACACCCT AT TAT AC AT C AACAAAT GAC 1800 

CAGAACCTTA TTCGAGCCCT T T G CAT GAGA GAT GAT GAC G AGCCATCGTT TACTTACTTA 1860 

AATTCTACCA AAGTTTTTCG AAGGCATAGT TCGTAATTTT TCAAGCACCA ATAAATAAGG 192 0 

AGAATCGGCT C AAAC AAAC G TGGCATTTGC C AC C AC G T GA GCACAAGGGA GAGTCTGTCG 198 0 

TCGTTTATGG AT GAAC TAT T CAATTTTTAT G CAT G T AAT A ATTAAGTTCA AGTT CAAGAG 2 04 0 

CCTTCTGCAT ATTTAACTAT G TAT T T GAAT T TAT C GAGT G TGATTTTCTG TCTTTGGCAA 2100 

CAT AT AT T T T TGTCATATGT GGCAT CTTAT TAT GAT AT C A TACAGT GTTT AT G GAT GAT A 2160 
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T GAT AC TAT C AAAAAAAAAA AA 2182 
(2) INFORMATION FOR SEQ ID NO: 9: 

(il SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATTAATCCTA G AAAAA CAT A GAAAG AG AG C GGAAGGAAAG AT T AAT CAT G GCTCTCAAAG 60 

TGTTAAGTGT TGCAACTCAA ATGGCGATTC CTAGCAACCT AACGACATGT CTTCAACCCT 12 0 

C AC AC T T C AA ATCTTCTCCA AAACTGTTAT CTAGCACTAA CAGTAGTAGT 17 0 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TATCCAGCAC TAACAGTAGT AGTAGGTCTC GCCTCCGTGT GTATTTCTCC TCCTCGCAAC 60 

TCACTACTGA AAGACGATCC GGAAACTACA ACCCTTCTCG TTGGGATGTC AACTTCATCC 12 0 

AATCGGTTCT CAGT GACTAT AAGGAGGACA AACACGT 157 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Leu Lys Val Leu Ser Val Ala Thr Gin Met Ala lie Pro Ser 

Asn Leu Thr Thr Cys Leu Gin Pro Ser His Phe Lys Ser Ser Pro Lys 

20 25 30 

Leu Leu Ser Ser Thr Asn Ser Ser Ser Arg Ser Arg Leu Arg Val Tyr 
35 40 45 

Cys Ser Ser Ser Gin Leu Thr Thr Glu Arg Arg Ser Gly Asn Tyr Asn 
50 55 60 
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Pro Ser Arg Trp 
65 

Lys GIu Asp Lys 



Lys Met Glu Leu 

100 

lie Asp Asp Leu 
115 

Phe Lys Glu lie 
130 

Asn Pro Phe Pro 
145 

Phe Arg Leu Leu 



Asp Ser Phe Lys 

180 

Asp Thr Arg Gly 
195 

Glu Gly Glu Thr 

210 

Leu Glu Glu Lys 
225 

Arg lie Ala Tyr 



Pro Asn Ala Pro 

260 

Asn Pro Val Val 
275 

Ala Gin Phe Gin 
290 

Thr Gly Phe Val 
305 

Cys Tyr Phe Trp 



Ala Arg lie Met. 

340 

Asp lie Tyr Asp 
355 

Asp Leu lie Arg 
370 



Asp Val Asn Phe 
70 

His Val lie Arg 

85 

Glu Lys Glu Thr 



Gin Arg Met Gly 

120 

Leu Ser Ser lie 
135 

Lys Glu Glu Arg 
150 

Arg Glu His Gly 
165 

Asn Glu Glu Gly 



Leu Leu Gin Leu 

200 

Thr Leu Glu Ser 
215 

Val Asn Glu Gly 
230 

Ser Leu Asp lie 
245 

Val Trp lie Glu 



Leu Glu Leu Ala 

280 

Glu Glu Leu Lys 
295 

Glu Lys Leu Pro 
310 

Asn Thr Gly lie 

325 

Met Gly Lys Val 



Val Tyr Gly Thr 

360 

Arg Trp Asp He 
375 



He Gin Ser Leu 
75 

Ala Ser Glu Leu 
90 

Asp Gin He Arg 
105 

Leu Ser Asp His 



Tyr Leu Asp Kis 

140 

Asp Leu Tyr Ser 
155 

Phe Gin Val Ala 
170 

Glu Phe Lys Glu 
185 

Tyr Glu Ala Ser 



Ala Arg Glu Phe 

220 

Gly Val Asp Gly 
235 

Pro Leu His Trp 

250 

Trp Tyr Arg Lys 

265 

lie Leu Asp Leu 



Glu Ser Phe Arg 

300 

Phe Ala Arg Asp 
315 

lie Glu Pro Arg 
330 

Asn Ala Leu He 
345 

Leu Glu Glu Leu 



Asn Ser He Asp 

380 



Leu Ser Asp Tyr 

80 

Val Thr Leu Val 
95 

Gin Leu Glu Leu 
11C 

Phe Gin Asn Glu 
125 

His Tyr Tyr Lys 



Thr Ser Leu Ala 

160 

Gin Glu Val Phe 
175 

Ser Leu Ser Asp 

190 

Phe Leu Leu Thr 
205 

Ala Thr Lys Phe 



Asp Leu Leu Thr 

240 

Arg He Lys Arg 

255 

Arg Pro Asp Met 

270 

Asn lie Val Gin 
285 

Trp Trp Arg Asn 



Arg Leu Val Glu 

320 

Gin His Ala Ser 
335 

Thr Val He Asp 
350 

Glu Gin Phe Thr 
365 

Gin Leu Pro Asp 
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Tyr Met Gin Leu 
385 

Ser Tyr Asp Val 



Arg Gin Ser Trp 

420 

Trp Phe Tyr Gly 
435 

Ser Trp Gin Ser 
450 

Arg Val Thr Asp 
465 

Tyr His Asp Leu 



Asp Leu Gly Thr 

500 

Ser Leu Gin Cys 
515 

Arg Lys His Val 
530 

Ala Glu Arg Val 
545 

Cys Ala Val Asp 



Asp Gly His Gly 

580 



Cys Phe Leu Ala 
393 

Met Lys Glu Lys 
405 

Val Asp Leu Ala 



Gly His Lys Pro 

440 

He Ser Gly Pro 
455 

Ser Phe Thr Lys 
470 

Val Arg Trp Ser 
485 

Ser Val Glu Glu 



Tyr Met Ser Asp 

520 

Lys Trp Leu He 
535 

Ser Lys Asp Ser 
550 

Leu Gly Arg Met 
565 

Thr Gin His Pro 



Leu Asn Asr. Phe 

395 

Gly Val Asn Val 
410 

Asp Lys Tyr Met 
425 

Ser Leu Glu Glu 



Cys Met Leu Thr 

4 6C 

Glu Thr Val Asp 
475 

Ser Phe Val Leu 
490 

Val Ser Arg Gly 
505 

Tyr Asn Ala Ser 



Ala Glu Val Trp 

540 

Pro Phe Gly Lys 
555 

Ala Gin Leu Met 
570 

lie He His Gin 
585 



Val Asc Asp Thr 

400 

He Pro Tyr Leu 
415 

Val Glu Ala Arg 
430 

Tyr Leu Glu Asn 
445 

His He Phe Phe 



Ser Leu Tyr Lys 

480 

A.rg Leu Ala Asp 
495 

Asp Val Pro Lys 
510 

Glu Ala Glu Ala 
525 

Lys Lys Met Asn 



Asp Phe He Gly 

560 

Tyr His Asn Gly 
575 

Gin Met Thr Arg 
590 



Thr Leu Phe Glu Pro Phe Ala 
595 
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CLAIMS 

The embodiments of the invention in which an exclusive property or privilege is 
claimed are defined as follows: 

1 . An isolated nucleotide sequence encoding limonene synthase. 

2. A nucleotide sequence of Claim 1 encoding limonene synthase from Mentha 

spicata. 

3. An isolated nucleotide sequence encoding a protein having the biological 
activity of SEQ ID No: 1 1 . 

4. The isolated nucleotide sequence of Claim 3 which encodes the amino acid 
sequence of SEQ ID No: 1 1 . 

5. The isolated nucleotide sequence of Claim 3 having the sequence of 
SEQ IDNo:8. 

6. A replicable expression vector comprising a nucleotide sequence encoding a 
protein having the biological activity of SEQ ID No: 1 1 . 

7. The replicable expression vector of Claim 6 wherein the nucleotide sequence 
comprises the sequence of SEQ ID No: 8. 

8. A host cell comprising a vector of Claim 6. 

9. A host cell comprising a vector of Claim 7. 

10. A method of enhancing the production of limonene synthase in a suitable host 
cell comprising introducing into the host cell an expression vector of Claim 6 under conditions 
enabling expression of the protein in the host cell. 
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pLC5 . 2 AG AG AG AG AG AG G AAG G AAAG AT T AAT C 2 8 

pLC4.1 gagag a c 

pLC8.5 attaat cctagaaaaacat a c 

pLC5 - 2 ATGGCTCTCAAAGTGTTAAGTGTTGCAACTCAAATGGCGATTCCTAGCAA 7 8 

pLC4.1 

pLC8.5 

MALKVLSVATQMAIPSN 17 



pLC5 . 2 CCTAACGACATGTCTTCAACCCTCACACTTCAAATCTTCTCCAAAACTGT 12 8 

pLC4.1 

pLC8 . 5 

LTTCLQPSHFKS SPKLL 34 



pLC5 . 2 TATCTAGCACTAACAGTAGTAGTCGGTCTCGCCTCCGTGTGTATTGCTCC 17 8 

pLC4.1 

pLC8.5 

pLClO.l c a 1 

SSTNSSSRSRLRVYCS 50 



pLC5 . 2 TCCTCGCAACTCACTACTGAAAGACGATCCGGAAACTACAACCCTTCTCG 22 8 

pLClO.l 

SSQLTTERRSGNYNPSR 67 



pLC5 . 2 TTGGGATGTCAACTTCATCCAATCGCTTCTCAGTGACTAT AAGGAGGA CA 27 8 

pLClO.l g 

WDVNFIQSLLSDYKEDK 84 



pLC5 . 2 AACACGTGATTAGGGCTTCTGAGCTGGTCACTTTGGTGAAGATGGAACTG 32 8 

pLClO.l 

HVIRASELVTLVKMEL 100 



pLC5 . 2 G AG AAAG AAAC G GAT C AAAT T C G AC AAC T T GAG T T GAT C GAT G AC T T G C A 37 8 

EKETDQIRQLELIDDLQ 117 

pLC5 . 2 GAGGATGGGGCTGTCCGATCATTT CC AAAATGAGTTCAAAGAAATCTTGT 4 2 8 

R M GLSDHFQNEFKEILS 134 

pLC5 . 2 CCTCTATATATCTCGACCATCACTATTACAAGAACCCTTTTCCAAAAGAA 4 7 8 

S I YLDHHYYKNPFPKE 150 

pLC5 . 2 GAAAGGGATCTCTACTCCACATCTCTTGCATTTAGGCTCCTCAGAGAACA 52 8 

ERDLYSTSLAFRLLREH 167 

pLC5 . 2 TGGTTTTCAAGTCGCACAAGAGGTATTCGATAGTTTCAAGAACGAGGAGG 57 8 

GFQVAQEVFDSFKNEEG 184 

pLC5 . 2 GTGAGTTCAAAGAAAGCCTTAGCGACGACACCAGAGGATTGTTGCAACTG 62 8 

EFKESLSDDTRGLLQL 200 

pLC5 - 2 TATGAAGCTTCCTTTCTGTTGACGGAAGGCGAAACCACGCTCGAGTCAGC 67 8 

YEAS FLLTEGETTLESA 217 
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pLC5 . 2 GAGGGAATTCGCCACCAAATTTTTGGAGGAAAAAGTGAACGAGGGTGGTG 72 8 

REFATKFLSEKVNE GGV 234 

pLC5 . 2 TTGATGGCGACCTTTTAACAAGAATCGCATATTCTTTGGACATCCCTCTT 77 8 

DGDLLTRIAYSLDI PL 250 

pLC5 . 2 CATTGGAGGATTAAAAGGCCAAATGCACCTGTGTGGATCGAATGGTATAG 82 8 

HWRI KRPNAPVWI EWYR 267 

pLC5 . 2 GAAGAGGCCCGACATGAATCCAGTAGTGTTGGAGCTTGCCATACTCGACT 87 8 

K R P D M NPVVLELAI LDL 284 

pLC5 . 2 TAAATATTGTTCAAGCACAATTTCAAGAAGAGCTCAAAGAATCCTTCAGG 92 8 

NIVQAQFQEELKES F R 300 

pLC5 . 2 TGGTGGAGAAATACTGGGTTTGTTGAGAAGCTGCCCTTCGCAAGGGATAG 97 8 

WWRNTGFVEKLPFARDR 317 

pLC5 . 2 ACTGGTGGAATGCTACTTTTGGAATACTGGGATCATCGAGCCACGTCAGC 102 8 

LVECYFWNTGIIEPRQH 334 

pLC5 . 2 ATGCAAGTGCAAGGATAATGATGGGCAAAGTCAACGCTCTGATTACGGTG 107 8 

A S A R I M M GKVNALITV 350 

pLC5 . 2 ATCGATGATATTTATGATGTCTATGGCACCTTAGAAGAACTCGAACAATT 112 8 

I DDIYDVYGTL E E L E Q F 367 

pLC5 . 2 CACTGACCTCATTCGAAGATGGGATATAAACTCAATCGACCAACTTCCCG 117 8 

TDLIRRWDINSIDQLPD 384 

pLC5 . 2 ATTACATGCAACTGTGCTTTCTTGCACTCAACAACTTCGTCGATGATACA 12 2 8 

YMQLC FLALNNFVDDT 400 

pLC5 . 2 TCGTACGATGTTATGAAGGAGAAAGGCGTCAACGTTATACCCTACCTGCG 12 7 8 

SY DVMKE KGVNVI P YLR 417 

pLC5 . 2 GC AAT CGTGGGTTG ATT TGGCGGATAAGT AT AT GGTAGAGGC AC GGTGGT 132 8 

QSWVDLADKYMVEARW F 434 

pLC5 . 2 TCTACGGCGGGCACAAACCAAGTTTGGAAGAGTATTTGGAGAACTCATGG 137 8 

YGGHKPSLEEYLENSW 450 

pLC5 . 2 CAGTCGATAAGTGGGCCCTGTATGTTAACGCACATATTCTTCCGAGTAAC 14 2 8 

QS ISGPCMLTHI FFRVT 467 

pLC5 . 2 AGATTCGTTCACAAAGGAGACCGTCGACAGTTTGTACAAATACCACGATT 14 7 8 

DSFTKETVDSLYKY HDL 484 

pLC5 . 2 TAGTTCGTTGGTCATCCTTCGTTCTGCGGCTTGCTGATGATTTGGGAACC 15 2 8 

VRWSSFVLRLADDLGT 500 
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pLC5 . 2 TCGGTGGAAGAGGTGAGCAGAGGGGATGTGCCGAAATCACTTCAGTGCTA 1578 

SVEEVSRGDVPKSLQCY 517 

i 

pLC5 . 2 CATGAGTGACTACAATGCATCGGAGGCGGAGGCGCGGAAGCACGTGAAAT 1628 

MSDYNASEAEARKHVKW 534 

pLC5 . 2 GGCTGATAGCGGAGGTGTGGAAGAAGATGAATGCGGAGAGGGTGTCGAAG 167 8 

LIAEVWKKMNAERVSK 550 

pLC5 . 2 G ATTCTC CAT TCGGC AAAGATTTT AT AGGATGTGCAGTTG ATT TAGGAAG 17 2 8 

DSP FGKDFI GCAVDLGR 567 

pLC5 . 2 G A.T GGC GC AG TT G AT GT ACC AT AAT G G AG ATGGG C AC GGC AC AC AAC AC C 17 7 8 

HAQLMYHNGDGHGTQHP 584 

pLC5 . 2 CTATTATACATCAACAAATGACCAGAACCTTATTCGAGCCCTTTGCATGA 18 2 8 

I I HQQMTRTLFE PFA 599 

pLC5 . 2 GAGATGATGACGAGCCATCGTTTACTTACTTAAATTCTACCAAAGTTTTT 18 7 8 

pLC5 . 2 CGAAGGCATAGTTCGTAATTTTTCAAGCACCAATAAATAAGGAGAATCGG 1 92 S 

pLC5 . 2 CTCAAACAAACGTGGCATTTGCCACCACGTGAGCACAAGGGAGAGTCTGT 197 8 

pLC5 . 2 CGTCGTTTATGGATGAACTATTCAATTTTTATGCATGTAATAATT AAGTT 2 0 2 8 

pLC5 . 2 CAAGTTCAAGAGCCTTCT GC AT ATTT AAC TATGTATTTGAATTTATCGAG 2 07 8 

pLC5 . 2 TGTGATTTTCTGTCTTTGGCAACATATATTTTTGTCATATGTGGCATCTT 212 8 

pLC5 . 2 AT TAT GAT AT CAT AC AG T GTT T AT GG AT GAT AT GAT AC TAT C AAAAAAAA 217 8 

pLC5.2 AAAA 2182 
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Castor Bean 1 MALPSAAMQSNPEKLNLFHRLSSLPTTSLEYGN . NRFPFFSSSAKSHFKK 4 9 

I . . . . : . : . I I I : I : : : - . I : | | • . | 

Spearmint 1 MALKVLSVATQ>3AIPSNLTTCLQPSHFKSSPKLLSSTNSSSRS 43 



Castor Bean 50 PTQACLSSTTHQEVRPLAYFPPTVWGNRFASLTFNPSEFESYDERVIVXK 99 

• ... | J • • | • . • | • j • •[ • | . | 

Spearmint 44 RLRVTfCSSSQLTTERRSGNYNPSRWDVNFIQSLLSDYKEDKHVIRASELiV 93 

Castor Bean 100 KKVKDILISSTSDSVETVILIDLL.CRLGVSYKFENDIEELLSKI FNSQ . . 147 

. ! I : : I : : Ml I I : I : I t I : I : : - I : I I . I : . : 

Spearmint 94 TLVK . MEI^KETDQIRQLELIDDLQI^GLSDHFQNEFKEILSSIYIJDHHY 142 

I I . . : - i : I I I : : | : | : | II : - | : . | | | . | | : : 

Tobacco 1 MLLATGRKLADTLNLIDIIERLGISYKFEKEIDEILDQIYNQK. . 43 



Castor Bean 148 . PDLVDEKECDLYTAAIVFRVFRQHGFKMSSDVFSKFKDSDGKFKESLRG 196 

- : .... I I ! I . . - : - I I : : I : i I I - : - : I I - . I I : . : I . | | | | | . : 
Spearmint 143 YKNPFPKEERDLYSTSIAFRIjLREHGFQVAQEVFDSFKNEEGEFKESIjSD 192 

- I . | | : . . . ] . | [| I [ : M I - : - - I : I . - I . : I : I . I I I I I - • 

Tobacco 44 . SN CNDLCTSALQFRLLRQHGFNISPEI FSKFQDENGKFKESLAS 87 



Castor Bean 197 DAKGMLSLFEASHLSVHGEDILEEAFAFTKDYLQSSAVE _ . LFPNLKRHI 244 

I . : I : I I : I I I I . . I I . . I I . I . I . . . : i : . . . 1 : . : | : 1 

Spearmint 193 DTRG1XQLYEAS FLLTEGETTLESAREFATKFLEEKVNEGGVDGDL.LTRI 242 

I . I I I - I I I I I : I - : : - - I I - I • I - i I I - - - : . : . | . . : 

Tobacco 88 DVLGLLNLYEASHVRTHADDILEDALAFSTIKLESAAPH . . LKSPLREQV 135 



Castor Bean 245 TNALEQPFHSGVPRLEARKFIDLYEADIECRNETLLEFAKLDYNRVQLLH 294 



Spearmint 243 AY S LD I PL.HWRI KRPNAPVWI EWY RKRPD . MNPWLELAILDLNIVQAQF 291 

. . . | : .||. : - I I - - - - : . : - I 1 : I : I i I : I : : I 

Tobacco 136 THALEQCLHKGVPRVETRFFISSIYDKEQSKNNVLLRFAKLDFNLLQMLH 185 



Castor Bean 2 95 QQELCQFSKWWKDLNLASDI PYARDRMAE IFFWAVAMY FEPDYAHTRMI I 34 4 



Spearmint 2 92 QEELKESFRWWRNTGFVEKLPFARDRLVECYFWNTGIIEPRQHASARIMM 341 

. : i I : Ml:: : I I . - I I : I I I I : I I I I I I - I : - - . I . . i I : I : 

Tobacco 186 KQELAQVSRWWKDLDFVTTLPYARDRWEC Y FWAL.GVY FEPQYSQARVML 2 3 5 



Castor Bean 345 AKWLLI SL> I DDT I DAY ATMEE T H I LAE AVARW DMS CLE KL P D YMKY I Y K 3 94 



Spearmint 342 GKVNALITVIDDIYDVYGTLEELEQFTDLIRRWDINSIDQLPDYMQLCFL 3 91 



Tobacco 23 6 VKTISMI SIVDDTFDAYGTVKELEAYTDAIQRWDINEIDRLPDYMKI S YK 2 85 
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Castor Bean 395 LLLNTFSEFEKELTAEGKSY SVKYGREAFQELVRGYYLEAVWRDEGKI PS 4 44 

||..:.:: I : . I I : . : : I . I : M I : I - I I 

Spearmint 392 ALNNF VPDTSYDV MKEKGVNVIFYLRQSW^ 441 

| : : : . | . : : . . : : : . . : . : : - . I II- 1 I - : I - - I - 

Tobacco 28 6 AILDLYKDYEKSLSSAGRSHIVCHAIERMKEWRNYNVESTWFIEGYMPP 3 35 

Castor Bean 445 FDDYLYNGSMTTGLPLVSTASFMGVQEITGLNEFQWLETNPKLSYASGAF 494 

: : : | | | : . . . : I : I I : I : i ... I I : . 

Spearmint 4 42 i^EYLENSWQSISGPCMLTHIFFRVTOSFTKETVDSLYKYHDLVRWSSFV 4 91 

: . | | | . | . : : : I : : : - • ■ • = 1 • ■ = • I : : 

Tobacco 336 VSEYLSNALATTTYYYLATTSYLGMKSATE . QDFEWLSKNPKILEASVII 384 

Castor Bean 4 95 IRLVNDLTSHVTEQQRGHVASCIDCYMNQHGVSKDEAVKILQKMATDCWK 54 4 

: | | . : | | . . | . I | | - | : . : : : I I I . : - - • I - • I I I : - — - = - I I 
Spearmint 4 92 LIU^DLGTSVEEVSRGDVPKSIiQCYMSDYN^^ 541 

| : || : | 1 | | | | : I . I I . I . I I . • • - - ---II 

Tobacco 3 85 CRVIDDTATYEVEKSRGQIATGIECCMRDYGISTKEAMAKFQNMAETAWK 434 

Castor Bean 545 EINEECM . RQSQVSVGHLMRIVNLARLTDVSYKYG . DGYTDSQQ . LKQFV 5 91 

. : | . | : : : I . - : : : | : | : | : . : : I . . II-.. : - : - i = 
Spearmint 542 KMNAERVSKD S PFGKDFI GC AVDLGRMAQiLMY . HNGDGHGTQHP II HQQM 590 

- | • : : . . | . : . : | : . . : : | : | : . : : I II 11 

Tobacco 435 DINEGLL . RPTPVSTEFLTPILNLARIVEVTYIHNLDGYTHPEKVLKPHI 483 

Castor Bean 592 KGLFVDPISI 601 

. : . : I : . 
Spearmint 591 TRTLFEPFA . 599 

. . | . : . : 

Tobacco 484 INLLVDSIKI 493 



Fig. 6B 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US94/12379 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C07H 21/04; C12N 1/16, 15/63; C12P 21/02 
US CL :536/23.2; 435/320.1, 255.2, 69.1 
According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536/23.2; 435/320.1, 255.2, 69.1 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
CAS. MEDUNE, BIOSIS, APS, UEMBL, GENBANK 

search terms: limonene synthase, Mentha spicata, DNA, gene, vector, expression 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 1 



x.p 



y. P 



A, P 



C station of document, with indication, where appropriate , of the relevant passages 



Journal of Biological Chemistry, Volume 268, Number 31, 
issued 05 November 1993, S. M. Colby etal., "4S-Limonene 
synthase from the oil glands of spearmint (Mentha spicata)", 
pages 23016-23024, see entire document. 

Plant Physiology, Volume 105, Supplement 1, issued 1994, 
D. J. McGarvey et al. f "Overexpression and mutagenesis of 
4S-limonene synthase from spearmint (Mentha spicata)", 
page 89, abstract. 

Journal of Biological Chemistry, Volume 269, Number 6, 
issued 1.1 February 1994, T. J. Savage et al. "Monoterpene 
synthases of Pinus contorta and related conifers", pages 
4012-4020, see entire document. 



Relevant to claim No. 



1-10 



1-10 



1-10 



I x| Further documents are listed in the continuation of Box C. | | See patent family annex. 



"A" 

'E - 
"L" 

*0" 
.p. 



Special cstegortea of cited document! : 

document defioing the general rtmlc of the art which is not coojkderod 
to be of particular relevance 

earlier document pub lie bed on or after the international filing dale 

document which may throw doubto on priority cfaum(s) or which a 
c«ed to ratahliah the pub be* boo date of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use. exhibition or other 



"X* 



document published prior to the international filing dale but later man 
the pr i ority date churned 



later document published after the international filing date or priority 
date and not ai conflict with the application but cited to understand the 
principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive itep 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
eonsslered to involve an inventive step when the document h 
combined with one or more other such documents, such combination 
being obvious to s person mkill^A tn the art 

document member of the same patent family 



Date of the actual completion of the international search 
18 JANUARY 1995 


Date of mailing of the international search report 

09FEB 1995 


Name and mailing address of the ISA/US 
1 Commissioner of Patents and Trademark* 
Box PCT 

Washington, D.C. 20231 
1 Facsimile No. (703) 305-3230 


Authorized officer ^ t 

LISA ARTHUR ^fc^S?^* 

Telephone No. (703) 308-0196 ' 



Form PCT/1SA/210 (second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US94/12379 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Archives of Biochemistry and Biophysics, Volume 301, Number 
1, issued 15 February 1993, W. R. Alonso et al., "Production and 
characterization of polyclonal antibodies in rabbits to 4S-limonene 
synthase from spearmint (Mentha spicata)", pages 58-63, see 
entire document. 

Journal of Biological Chemistry, Volume 267, Number 11, issued 
15 April 1992, W. R. Alonso et al., "Purification of 4S-limonene 
synthase, a monoterpene cyclase from the glandular trichomes of 
peppermint (Mentha x piperita) and spearmint (Mentha spicata)", 
pages 7582-7587, see entire document. 

Archives of Biochemistry and Biophysics, Volume 299, Number 
1, issued 15 November 1992, J. I. M. Rajaonarivony et al., 
"Evidence for an essential histidine residue in 4S-limonene 
synthase and other terpene cyclases", pages 77-82, see entire 
document. 



1-10 



1-10 



1-10 



Form PCT/lSAy210 (continuation of second sheet)(July 1992)* 



